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The present invention provides methods for preparing a stool sample in order to screen for the presence of indicators of a disease, 
for example a subpopulation of cancerous or precancerous cells. The methods take advantage of the recognition that cellular debris from 
cancerous and precancerous cells is deposited onto only a longitudinal stripe of stool as the stool is forming in the colon. Accordingly, 
methods of the invention comprise obtaining a representative sample, such as a circumferential or cross-sectional sample of stool in order 
to ensure that any disease indicator, such as cellular debris that is shed by colonic cells, is obtained in the sample. 
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METHODS FOR DISEASE DIAGNOSIS FROM STOOL SAMPLES 

This patent application is a continuation-in-part of U.S. Serial Number 
08/699,678, filed August 14, 1996, the disclosure of which is incorporated herein by 
reference. 

FIELD OF THE INVENTION 

5 This invention relates to methods for disease diagnosis, including the early 

detection of colon cancer in patients, and more particularly to methods for preparing 
stool samples for disease diagnosis, including the detection of colon cancer, so as to 
assure or increase the likelihood that the sample will contain the diagnostically relevant 
information if the patient has a disease, for example a cancerous or precancerous 
10 lesion, and to methods for stool sample analysis. 

BACKGROUND OF THE INVENTION 

Stool samples frequently must be prepared for medical diagnostic analysis. 
Stool samples may be analyzed to help diagnose medical conditions ranging from 
parasitic, bacterial or viral infections to inflammatory bowel disease and colorectal 
15 cancer. 

Colorectal cancer is a leading cause of death in Western society. However, if 
diagnosed early, it may be treated effectively by surgical removal of the cancerous 
tissue. Colorectal cancers originate in the colorectal epithelium and typically are not 
extensively vascularized (and therefore not invasive) during the early stages of 

20 development. Colorectal cancer is thought to result from the clonal expansion of a 
single mutant cell in the epithelial lining of the colon or rectum. The transition to a 
highly vascularized, invasive and ultimately metastatic cancer which spreads 
throughout the body commonly takes ten years or longer. If the cancer is detected 
prior to invasion, surgical removal of the cancerous tissue is an effective cure. 

25 However, colorectal cancer is often detected only upon manifestation of clinical 

symptoms, such as pain and black tarry stool. Generally, such symptoms are present 
only when the disease is well established, often after metastasis has occurred, and the 
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prognosis for the patient is poor, even after surgical resection of the cancerous tissue. 
Early detection of colorectal cancer therefore is important in that detection may 
significantly reduce its morbidity. 

Invasive diagnostic methods such as endoscopic examination allow for direct 
5 visual identification, removal, and biopsy of potentially cancerous growths such as 
polyps. Endoscopy is expensive, uncomfortable, inherently risky, and therefore not a 
practical tool for screening populations to identify those with colorectal cancer. Non- 
invasive analysis of stool samples for characteristics indicative of the presence of 
colorectal cancer or precancer is a preferred alternative for early diagnosis, but no 

1 0 known diagnostic method is available which reliably achieves this goal. 

Current non-invasive diagnostic methods involve assaying stool samples for the 
presence of fecal occult blood or for elevated levels of carcinoembryonic antigen, both 
of which are suggestive of the presence of colorectal cancer. Additionally, recent 
developments in molecular biology provide methods of great potential for detecting the 

1 5 presence of a range of DNA mutations or alterations associated with and indicative of 
the presence of colorectal cancer. The presence of such mutations theoretically can 
be detected in DNA found in stool samples during the early stages of colorectal cancer. 
However, stool comprises cells and cellular debris from the patient, from 
microorganisms, and from food, resulting in a heterogeneous population of cells. This 

20 makes detection of a small, specific subpopulation impossible to detect reliably. 

Stool diagnostic assays for colorectal cancer described in the art typically are 
performed on samples prepared from randomly sampled portions of voided stool. 
However, samples prepared according to such methods do not reproducibly yield 
characteristics indicative of the presence of colorectal cancer or precancer, even when 

25 prepared from stool voided by a patient with colorectal cancer or precancer. There is 
therefore a need in the art for methods for early diagnosis of colorectal cancer or 
precancer that will reproducibly detect characteristics indicative of the presence of 
cancerous or precancerous material in samples prepared from stool voided by a patient 
with colorectal cancer or precancer. Such methods are provided herein. 
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SUMMARY OF THE INVENTION 

It has now been appreciated that cells and cellular debris are shed from colonic 
epithelial cells onto forming stool in a longitudinal "stripe" of material along the length 
of the stool. The shed material is confined to this longitudinal stripe, as shown in 
5 Figure 1 (designated *C"). Based upon this recognition, Applicants teach that stool 
sample preparation for diagnostic testing must include taking a representative sample 
in order to ensure that the sample will contain any cells or cellular debris that was shed 
into the stool as it passed through the colon. Accordingly, methods of the invention 
comprise obtaining at least a representative (e.g. a cross-section or circumferential 

10 surface) portion of stool voided by a patient, and performing an assay to detect in the 
sample the presence of cells or cellular debris shed from epithelial cells lining the colon 
that may be indicative of cancer or precancer. Most often, such cells will be derived 
from a polyp or a cancerous or precancerous lesion at a discrete location along the 
colon. For purposes of the present invention, a precancerous lesion comprises 

15 precancerous cells, and precancerous cells are cells that have a mutation that is 
associated with cancer and which renders such cells susceptible to becoming 
cancerous. As shown in Figure 1, a cross-sectional sample is a sample that contains 
at least a circumferential surface of the stool (or portion of a stool comprising an entire 
cross-sectional portion), as, for example, in a coronal section or a sagittal section. A 

20 sample comprising the surface layer of a stool (or of a cross-section of a stool) also 
contains at least a circumferential surface of the stool. Both cross-sections and 
circumferential surfaces comprise longitudinal stripes of sloughed colonic epithelium, 
and are therefore representative samples. 

In a preferred embodiment, methods of the invention comprise the steps of 

25 obtaining at least a circumferential surface or cross-sectional portion of a stool voided 
by a patient, and performing an assay to detect debris indicative of disease. For 
example, such debris may comprise a clonal subpopulation of cells having one or more 
mutations (for purposes of the present application, a mutation is a deletion, 
substitution, addition, modification, intercalation or rearrangement of DNA). Preferred 

30 methods of the invention comprise detection of characteristics of such transformed 
cells, including, for example, mutations, proteins expressed uniquely or in altered 
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amounts in transformed cells, and blood. Particularly preferred methods of the 
invention comprise obtaining at least a circumferential surface or cross-sectional 
portion of a stool sample, and performing an assay to detect DNA characteristics 
indicative of the presence of a clonal subpopulation of cells in the sample. The clonal 
5 subpopulation may be, for example, a subpopulation of cancerous or precancerous 
cells, having a mutation in, for example, a p53 tumor suppressor gene. Clonal 
subpopulations of cells detected by methods according to the invention are often 
characterized by a massive loss of DNA, resulting in a loss of heterozygosity that 
renders ineffective the gene or genes encompassed by the deletion. Alternative 

1 0 methods of the invention comprise performing an assay to detect the presence of an 
infectious disease, for example debris from a microorganism. 

Methods of the invention also comprise obtaining a representative (/.e. ( cross- 
sectional or circumferential) sample of stool and homogenizing the stool in a buffer, 
such as a buffer comprising a detergent and a proteinase and optionally a DNase 

15 inhibitor. 

In methods according to the invention, an assay performed on at least a 
circumferential surface or cross-sectional portion of stool may be an assay to detect a 
disease when a diagnostic indicator of the disease is incorporated into stool. Methods 
of the invention are useful to detect cellular debris shed from the epithelial lining of the 

20 colon, and from other tissue sources that shed cellular material into the gastrointestinal 
tract and preferably into the colon. Shed material may be from colonic epithelial cells, 
pancreatic cells, or may be indicative of infection (e.g. bacterial or viral nucleic acids or 
proteins). In methods according to the invention, an assay performed on at least a 
circumferential surface or cross-sectional portion of stool may be an assay to detect the 

25 presence of elevated levels of carcinoembryonic antigen shed from cells lining the 
colon. Such an assay may also comprise detecting the presence of occult blood. 
However, methods of the invention preferably comprise an assay wherein the sample is 
exposed to an antibody that specifically binds to a molecule characteristic of cellular 
debris shed from cells comprising a subpopulation of cells having a mutation that is 

30 potentially associated with cancer. 
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Methods of the invention are especially and most preferably useful for detecting 
DNA characteristics indicative of a subpopulation of transformed cells in a 
representative stool sample. The DNA characteristics may be, for example, mutations, 
including loss of heterozygosity, microsatellite instability, and others. An assay for 
5 DNA characteristics in a method of the invention may comprise the step of determining 
whether a difference exists in a number X of a first allele known or suspected to be 
mutated in a subpopulation of cells in a representative stool sample, and a number Y of 
an allele known or suspected not to be mutated in the sample, a statistically-significant 
difference being indicative of a mutation and the possible presence of cancer in a 

10 subpopulation of cells in the sample. In an embodiment of the invention, the difference 
between a number of a tumor suppressor gene and a number of a non-cancer- 
associated gene are compared, a statistically-significant difference in the numbers 
being indicative of a mutation in the tumor suppressor gene. 

Assays useful in the practice of methods according to the invention also include 

1 5 an assay to detect the presence of a deletion or other mutation in a region 

encompassing a polymorphic nucleotide. In such an assay, a number of a polymorphic 
nucleotide present at maternal and paternal alleles, wherein the patient is 
heterozygous for the polymorphic nucleotide, is determined. A statistically significant 
difference between a number of a polymorphic nucleotide in a maternal allele and a 

20 paternal allele is indicative of the presence of a deletion in one of the two alleles. 
Another useful assay is one to detect an infectious disease. 

Methods of the invention typically include, following sample preparation and an 
assay for characteristics of cells or cellular debris, a visual examination of the colon in 
order to determine if a polyp or other lesion is, in fact, present. Finally, surgical 

25 resection of abnormal tissue may be done in order to prevent the spread of cancerous 
or precancerous tissue. 

Accordingly, methods of the invention provide means for screening for the 
presence of a cancerous or precancerous subpopulation of cells in a heterogeneous 
sample, such as a stool sample. Methods of the invention reduce morbidity and 

30 mortality associated with lesions of the colonic epithelium. Moreover, methods of the 
invention comprise more accurate screening methods than are currently available in 
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the art, because current methods take advantage of the observation that cancerous or 
precancerous cells shed debris only onto or into part of the surface of the forming stool. 
The present methods reliably assay over the entire circumference of the stool, thereby 
increasing the likelihood of detecting an abnormality if one exists. Further aspects and 
5 advantages of the invention are contained in the following detailed description thereof. 

DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram of a cylinder which represents a formed stool and shows 
various cross-sections which will contain material from the entire circumference of a 
stool. The Section labeled "A" is a typical coronal section and the section labeled "B" is 
10 a typical sagittal section. The strip labeled U C represents material shed from 
cancerous tissue which is deposited in a longitudinal stripe. 

Figure 2 is a schematic diagram of a receptacle for containing a stool sample. 

Figure 3 is a schematic diagram of a multi-orifice impedance counter; wherein 
reference numeral 1 indicates the direction of flow through the column; reference 
15 numeral 2 indicates a plunger means for forcing material downward in the column; 
reference numerals 3 and 4 are different-sized hybridization beads; reference numeral 
5 is an optional filter for extracting unwanted particles; reference numeral 6 indicates 
an array of orifices for measuring differential impedance; and reference numeral 7 is a 
collection chamber. 

20 Figure 4 is a diagram showing primers useful for the detection of single base 

polymorphisms. 

DETAILED DESCRIPTION OF THE INVENTION 

Methods according to the present invention are useful for the preparation of 
stool samples that will reproducibly contain cells or cellular debris shed from a clonal 

25 population of cancerous or precancerous cells, if such a population is present at any 
site along the colon of a patient. These samples are then used to perform assays to 
detect characteristics indicative of cancer in a highly-reproducible and accurate way. 
In preferred methods of the invention, a sample representative of the surface of a stool 
is obtained. Such methods provide an improvement over the art inasmuch as they 

30 teach removing the portion of stool onto which the colonic epithelium sheds debris. 
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Two preferred methods for obtaining such a portion are cross-sectioning and extracting 
the surface of the stool. In a more preferred method, the surface of a cross-section is 
obtained. Without the recognition that at least a cross-sectional sample must be 
obtained, there is no means for reproducibly obtaining a sample that will contain a 
5 cancerous or precancerous subpopulation of cells, if one exists. 

Methods described in the art do not recognize that, unlike infection by parasites, 
bacteria and viruses, characteristics indicative of the presence of colon cancer, 
especially early stage colon cancer, are found only in a specific portion of voided stool. 
If the sampled portion of stool does not include the portion that happens to contain 

10 cells and cellular debris shed from early-stage cancer tissue, the diagnostic assay 
necessarily will fail to detect the characteristics indicative of the presence of colorectal 
cancer in a reliable manner even if homogenized, i.e.; will produce a false-negative 
result. Methods of the invention are also useful to detect the presence of bacteria, 
virus or parasites, as they will also be present on the surface of formed stool. 

1 5 Sloughed cells from, for example, a polyp forming in the epithelial lining of the 

colon, or on early stage cancerous lesions, are sloughed onto only the portion of the 
forming stool that comes into contact with the polyp or lesion. Accordingly, in early 
stage disease, only a small portion of the surface layer of the forming stool will contain 
sloughed cells, and if that portion happens not to be taken as part of the sample, an 

20 assay for indicia of colon cancer necessarily will produce a false-negative result. A 
brief review of the anatomy and physiology of the colon will aid in an understanding of 
this phenomenon. 

A typical adult colon is approximately six feet in length, with a diameter of about 
two to three inches. Numerous bends and folds are present throughout its length. The 

25 colon removes water from liquid or semi-liquid waste material that enters the colon, and 
relatively solid stool begins to form in the proximal third of the colon. Epithelial cells 
line the lumen of the colon, and the lumenal surface is organized into microscopic 
crypts. Colorectal epithelial cells are replaced every four to five days. The epithelial 
cells divide rapidly at the base of the crypts and migrate to the apeces, where cells 

30 appear to undergo apoptosis (programmed cell death), and cellular debris is shed into 
the lumen. The lining of the colorectal lumen is elastic and the diameter of the lumen is 
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determined by the volume of stool that is passing through the colon at any given time. 
As a result, the surface of the forming stool passing through the colon is in direct 
contact with the epithelial lining of the lumen. Shed epithelial cells (which may or may 
not have undergone apoptosis) and cellular debris therefore are incorporated onto the 
5 surface of stool as it passes through the colon. 

Cells and cellular debris from colorectal epithelial cancers therefore also are 
shed onto forming stool. Most colorectal cancers develop in regions of the colon where 
stool is relatively solid, indeed approximately one third of such cancers develop in the 
rectum. Markers indicative of the presence of cancer, including cells, cellular debris, 

1 0 DNA, blood, and carcinoembryonic antigen, are shed onto the portion of the forming 
stool that contacts the cancerous tissue as the stool passes through the colon. Since 
the stool is relatively solid, these markers will remain on or near the surface of the stool 
where they were deposited and will not be homogeneously dispersed throughout the 
stool. As stool passes over a cancerous or precancerous growth, material from the 

15 growth will be deposited along the stool, but only on the part of the stool circumference 
that comes into direct contact with the cancerous or precancerous tissue comprising 
the lesion. Stool voided by a patient with colorectal cancer or precancer is therefore 
characterized by a longitudinal "stripe 0 of diagnostically relevant material derived from 
the cancerous or precancerous tissue. 

20 A sample that does not include material from the entire circumference of a stool 

voided by a patient with colorectal cancer or precancer will not reproducibly contain 
material derived from the cancerous or precancerous tissue. Currently, random, non- 
cross-sectional samples ("smears") of voided stool are analyzed in clinical settings. In 
these, sloughed cancerous or precancerous cells and cellular debris have no 

25 possibility of detection unless the sample happens by chance to contain the portion of 
stool which made contact with the region of the colon from which cells were sloughed. 

Furthermore, cancers typically develop by clonal expansion of a single mutant 
cell, and in the early stages of the disease, i.e., when surgical removal is an effective 
cure, the cancerous lesion will be very small and may lie on a small arc of the 

30 circumference of the colon. Material derived from such an early stage cancer therefore 
will be shed onto or into stool in a very narrow stripe (labeled C in Figure 1). 
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Consequently, a sample that does not contain the entire circumference of a stool 
voided by a patient with early stage colorectal cancer or precancer only by chance will 
contain material indicative of the presence of the early stage cancerous or 
precancerous condition. However, early detection of colorectal cancer is very 
important for effective surgical intervention. The present invention provides methods 
for reproducible early detection of characteristics indicative of the presence of cancer 
or precancer in a patient. 

Similarly, the present invention provides methods for reproducibly detecting an 
indicator of the presence of an infectious agent, such as a virus, bacterium, parasite or 
other microorganism, in stool from an infected patient. Methods of the invention are 
particularly useful for reproducibly detecting an infectious agent, or an indicator 
characteristic thereof, that is deposited onto narrow area of stool as it forms in the 
colon. 

Analysis of at least a circumferential surface or cross-sectional sample of stool 
ensures that a disease indicator, incorporated on a stool as it moves through the colon, 
will be present in a diagnostic assay sample. For example, analysis of at least a 
circumferential surface or cross-sectional sample of stool (including a whole stool), as 
shown in Figure 1, ensures that at least a portion of cells and cellular, debris shed from 
any existing cancerous or pre-cancerous cells (even if shed from small early stage 
cancerous or pre-cancerous tissue, e.g., small polyps) will be present in the portion of 
the stool sample to be analyzed. Indeed taking at least a circumferential surface or 
cross-section of the stool sample avoids the possibility of analyzing stool portions that 
will not contain sloughed cancerous or precancerous cells even when the patient has 
colorectal cancer or precancer. 

Once a circumferential surface or cross-sectional sample of stool is obtained, it 
may be homogenized by known methods to distribute cells and cellular debris 
throughout the sample. An assay then is performed on the homogenate, or an extract 
of the homogenate, to detect the presence of cells and/or cellular debris in the sample. 
The assay may be any one or a combination of histological cellular assays, antibody 
based immunoassays (or other formats) designed to detect the presence of a molecule 
characteristic of transformation such as a protein, or DNA-based assays for detecting 
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mutations or genetic characteristics indicative of colorectal cancer. Known assay 
protocols, those disclosed herein or in U.S. Patent No. 5,670,325, or assays hereafter 
developed may be used in the practice of the invention. Non-limiting examples of 
useful known assay protocols include those disclosed in U.S. Patent Nos. 5,137,806 
5 (detection of sequences in selected DNA molecules), 5,348,855 (assay for nucleic acid 
sequences), 5,512,441 (detection of mutant alleles), 5,272,057 and 5,380,645 (RFLP 
analysis), 5,527,676 (detection of p53 gene sequences), 5,330,892 (detection of MCC 
gene sequences), 5,352,775 (detection of APC gene sequences), 5,532,108 (detection 
of DCC gene sequences), and in WO96/08514 (monoclonal antibodies against human 

10 colon carcinoma-associated antigens), the disclosures of each of which are 

incorporated by reference herein. Alternatively, or additionally, an assay for fecal 
occult blood may be performed as reported in U.S. Patent Nos. 4,333,734 and 
5,196,167, incorporated by reference herein. Assays useful in the context of the 
present invention also include an assay for carcinoembryonic antigen as reported in 

15 U.S. Patent No. 5,380,647, incorporated by reference herein. In addition, assays 

described in USSN 08/808,763; 08/815,576; 08/876,857; 08/876,638; 08/877,333; and 
60/063,219; incorporated by reference herein, are also useful in the context of the 
present invention. Finally, the sample may be prepared, as reported in U.S. Patent No. 
4,857,300, incorporated by reference herein, for histological examination to detect 

20 characteristics indicative of the presence of cancerous or precancerous cells. 

The purpose of any assay protocol used in connection with obtaining at least a 
cross-sectional sample is to identify candidates for subsequent invasive diagnostic 
procedure such as colonoscopy or sigmoidoscopy. The assay accordingly need not 
definitively detect the presence of a cancerous or precancerous lesion, although false 

25 negatives obviously are to be avoided. The goal of the test protocol is not to determine 
whether in the vast quantities of cell debris in the sample there are a few cells which 
bear a mutation commonly associated with early stage transformation, but rather 
whether the sample contains debris indicative of clonal expansion of a mutant cellular 
subpopulation. Maximum benefit will come from an assay designed to detect the likely 

30 presence of a clonally expanded cell population, i.e., of transformed colonic epithelial 
cells which comprise the cancerous or precancerous lesions. Assays having the ability 



WO 99/5331* 



PCT/US99/08013 



-11 - 

to detect cancer or precancer in early stages are preferred in methods of the invention. 
Assays using polymerase chain reaction (PCR), restriction fragment length 
polymorphism (RFLP) or other methods for nucleic acid analysis may be used to detect 
known DNA characteristics indicative of the presence of colorectal cancer or 
5 precancer. More precise methods for quantitative detection of cellular debris, such as 
DNA fragments or segments, may be used to analyze cross-sectional samples 
according to methods described herein. 

A preferred assay interrogates the sample for DNA characteristics indicative of 
the development of cancer or precancer. However, assays for use with methods of the 
1 0 invention may detect any abnormal cellular debris shed from clinically-relevant 

transformed tissue. Thus, in accordance with a preferred aspect of the invention, an 
assay is used to detect the presence of characteristics of cells which have experienced 
loss of heterozygosity, microsatellite instability or other mutation. 

The following examples provide details of methods according to the invention. 
15 However, numerous additional aspects of the invention, especially in terms of assays to 
b e performed, will become apparent upon consideration of the following detailed 
description thereof. 
Example 1 

Preparation of a Stool Sample 

20 A sample is prepared such that it contains at least a circumferential surface or 

cross-sectional portion of a stool voided by a patient. 

Stool is voided into a receptacle that is preferably small enough to be 
transported to a testing facility. The receptacle may be fitted to a conventional toilet 
such that the receptacle accepts stool voided in a conventional manner. The 

25 receptacle may comprise a mesh or screen of sufficient size and placement such that 
stool is retained while urine is allowed to pass through the mesh or screen and into the 
toilet. The receptacle additionally may comprise means for removing a circumferential 
surface or cross-sectional portion from the stool. Moreover, the receptacle may 
comprise means for introducing homogenization buffer or one or more preservatives, 

30 such as alcohol, a solution of high salt concentration, antibiotics, and chaotropic salts 
in order to neutralize bacteria present in the stool sample. The homogenization buffer 
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may be a physiologically compatible buffer such as phosphate buffered saline, and may 
comprise salt such as 20-100 mM NaCI or KCL The homogenization buffer may also 
comprise a detergent, such as 1-10% SDS or triton, and/or a proteinase, such as 
proteinase K. The buffer may also contain inhibitors of DNA and RNA degrading 
5 enzymes. 

The receptacle, whether adapted to fit a toilet or simply adapted for receiving the 
voided stool sample, should include sealing means sufficient to contain the voided 
stool sample and any solution added thereto and to prevent the emanation of odors. 
An exemplary receptacle is shown in Figure 2. A shown in that figure, the receptacle 

10 has a support frame 1 which is placed directly over the toilet bowl 2. The support frame 
1 has attached thereto an articulating cover 3 which may be placed in a raised position, 
as shown in figure 2, for depositing of sample or a closed position (not shown) for 
sealing voided stool within the receptacle. The support frame 1 additionally has a 
central opening 4 traversing from a top surface 5 through to a bottom surface 6 of the 

15 support frame 1. The bottom surface 6 directly communicates with a top surface 7 of 
the toilet 2. Extending from the bottom surface 6 of the support frame 1 is a means 8 
for capturing voided stool. Means 8 may be fixedly attached to the support frame 1 or 
may be removably attached for removal subsequent to deposition of stool. Means 8 
may comprise a further means for removing at least a circumferential surface or cross- 

20 sectional portion from the voided stool. A preferable sample size is at least 5-10 g or at 
least 5-10 ml. A means to assess the presence of a minimal sample size may comprise 
a physical diagram indicating the minimal sample size. Alternatively a means to assess 
the presence of a minimal sample size may comprise the displacement of a liquid or of 
a mechanical device to a minimal level upon deposit of the stool sample. 

25 In a preferred embodiment, a cross-sectional portion is removed from the voided 

stool by making one or more sagittal or coronal section through the stool, as shown in 
Figure 1. The removed portion comprises material from an entire circumferential 
surface of the stool. Alternatively, a circumferential surface of a stool is prepared by 
taking the entire surface layer of the stool, or the entire surface layer of a cross- 

30 sectional portion of the stool, in a preferred embodiment, the surface layer is 

approximately 0.001 to 0.1 mm thick. In another preferred embodiment, the surface 
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layer is approximately 0.1 to 20 mm thick. In a more preferred embodiment, the surface 
layer is approximately 5 mm thick. In a most preferred embodiment, the surface layer is 
representative of material deposited on the forming stool, and reproducibly contains 
markers or indicators, such as shed cellular material, of a disease that is present in the 

5 patient. In one embodiment, a stool surface layer is obtained by cutting or scraping off 
the surface of the stool or cross-sectional portion of the stool. In a preferred 
embodiment, a stool surface layer is prepared using a cylindrical cutting device (for 
example a rigid loop) that has a diameter that is smaller than the diameter of the stool 
or cross-sectional portion of the stool. The surface layer is obtained when the stool 

10 sample is forced through the cutting device. In a preferred embodiment, the diameter 
of the cutting device is adjustable. Alternatively, a circumferential surface of a stool is 
obtained by contacting the surface of a stool or cross-sectional portion of a stool with a 
sampling device. In one embodiment, the sampling device is first wrapped around the 
stool sample and subsequently removed or peeled off. The layer of stool that is peeled 

15 off with the sampling device is then processed as described herein. Alternatively, the 
stool sample is rolled along the surface of the sampling device, and the layer of stool 
that adheres to the surface is then processed as described herein. Alternatively, a 
diagnostic assay is performed directly on the layer of material on the sampling device. 
It is important that the entire circumference of the stool is sampled. In a preferred 

20 embodiment, the sampling device has a surface that stool adheres to. In a more 
preferred embodiment, the sampling device surface preferentially binds diagnostic 
markers or indicators of a disease (for example nucleic acid or protein molecules), and 
thereby removes them from the surface of the stool sample. In one embodiment, the 
sampling device is an adherent paper or membrane. In a preferred embodiment, a 

25 stool is frozen prior to processing. In one embodiment the stool is frozen to between -5 
and -30 °C. In a preferred embodiment the cross-sectional portion or circumference of 
the stool is obtained from the frozen or partially thawed stool. In an alternative 
embodiment, a whole stool may be used for diagnostic assays. The stool or portion of 
stool that is obtained contains sufficient material to allow subsequent diagnostic assays 

30 to be performed. 
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Once obtained, the entire circumference or cross-sectional sample of stool is 
homogenized in an appropriate buffer, such as phosphate buffered saline. 
Homogenization means and materials for homogenization are generally known in the 
art. Thus, particular homogenization methods may be selected by the skilled artisan 
5 and may depend upon the assay to be employed. The buffer may contain detergent, 
salt, proteinase, inhibitors of DNA and RNA degrading enzymes. The composition of 
the buffer will depend on the type of assay to be performed. If a fecal occult blood 
assay is to be performed, the buffer may contain chemical compounds which react with 
blood to produce a color, the intensity of which can be measured. Buffers useful for 
10 detecting the presence of fecal occult blood are known in the art. If a test is to be 
performed for the presence of a particular protein, the buffer should not contain a 
proteinase capable of degrading such tumor marking antigens. 

DNA or RNA may be isolated from the homogenate using methods known in the 
art. Subsequent tests may be performed on the isolated DNA and RNA. 

15 Example 2 

Exemplary Enumerative methods for Detection of Colorectal Cancer or 
Precancer in stool samples 

DNA characteristics associated with the presence of colorectal cancer or a 
precancerous lesion may be detected in stool samples prepared according to the 
20 invention, using, for example, the methods described in the following sections. A 
careful endoscopic examination preferably is performed on positive Individuals, 
followed by early surgical excision of any diseased tissue. 

A. Reference-Target 

Methods of the invention are used to prepare a stool sample followed by 
25 detection of a deletion or other mutation in the p53 tumor suppressor gene. The p53 
gene is a good choice because a loss of heterozygosity in p53 is often associated with 
colorectal cancer. An mRNA sequence corresponding to the DNA coding region for 
p53 is reported as GenBank Accession No. M92424. At least a cross-section of a 
voided stool sample is obtained and prepared according to methods of the invention as 
30 described immediately above. The sample need not be further processed for analysis. 
However, DNA or RNA may optionally be isolated from the sample according to 
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methocls known in the art. See, Smith-Ravin, et a/., Gut, 36: 81-86 (1995), 
incorporated by reference herein. 

Nucleic acids may be sheared or cut into small fragments by, for example, 
restriction digestion. The size of nucleic acid fragments produced is not critical, subject 

5 to the limitations described below. A target allele that is suspected of being mutated 
(p53 in this example) and a reference allele are chosen. A reference allele may be any 
allele known normally not to be mutated in colon cancer. 

Either portions of a coding strand or its complement may be detected. For 
exemplification, detection of the coding strand of p53 and reference allele are 

10 described herein. Complement to both p53 and reference allele are removed by 
hybridization to anti-complement oligonucleotide probes (isolation probes) and 
subsequent removal of duplex formed thereby. Methods for removal of complement 
strands from a mixture of single-stranded oligonucleotides are known and include 
techniques such as affinity chromatography. Upon converting double-stranded DNA to 

15 single-stranded DNA [See, e.g., Sambrook, et a/. t Molecular Cloning, A Laboratory 
Manual (1989) incorporated by reference herein], sample is passed through an affinity 
column packed with bound isolation probe that is complementary to the sequence to be 
isolated away from the sample. Conventional column chromatography is appropriate 
for isolation of complement. An affinity column packed with sepharose or other 

20 appropriate materials with attached complementary nucleotides may be used to isolate 
complement DNA in the column, while allowing DNA to be analyzed to pass through the 
column. See Sambrook, Supra. As an alternative, isolation beads may be used to 
exclude complement as discussed in detail below. 

After removal of complement strands, first oligonucleotide probes which 

25 hybridize to at least a portion of the p53 allele and second oligonucleotide probes that 
hybridize to at least a portion of the reference allele are obtained. The probes are 
labeled with a detectable label, such as fluorescein or with detectable particles. 
Distinct labels for the probes are preferred. However, identical labels may be used if, 
for example, sample is assayed in two separate aliquots. Probes may be labeled with 

30 identical or with distinct labels. However, distinct labels are preferred. 
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Labeled probes then are exposed to sample under hybridization conditions. 
Such conditions are well-known in the art. See, e.g., Wallace, et aL, Nucleic Acids 
Res., 6:3543-3557 (1979), incorporated by reference herein. First and Second 
oligonucleotide probes that are distinctly labeled (i.e. with different radioactive 
5 isotopes, fluorescent means, or with beads of different size, See infra) are applied to a 
single aliquot of sample. After exposure of the probes to sample under hybridization 
conditions, sample is washed to remove any unhybridized probe. Thereafter, 
hybridized probes are detected separately for p53 hybrids and reference allele hybrids. 
Standards may be used to establish background and to equilibrate results. Also, if 

10 differential fluorescent labels are used, the number of probes may be determined by 
counting differential fluorescent events in a sample that has been diluted sufficiently to 
enable detection of single fluorescent events in the sample. Duplicate samples may be 
analyzed in order to confirm the accuracy of results obtained. 

If there is a statistically-significant difference between the amount of p53 

15 detected and the amount of the reference allele detected, it may be assumed that a 
mutation has occurred in p53 so as to alter its sequence and prevent hybridization of 
the probe, or that at least a portion of the region of the genome containing p53 has 
been lost in a subpopulation of cells shed from the colon. The patient therefore may be 
at risk for developing or may have developed colon cancer. Statistical significance may 

20 be determined by any known method. See, e.g., Steel, et aL, Principles and 

Procedures of Statistics: A Biomedical Approach (McGraw, Hill, 1980). Statistical 
methods are also outlined in co-owned U.S. Patent No. 5,670,325. 

The determination of a p53 mutation allows a clinician to recommend further 
treatment, such as endoscopy procedures, in order to further diagnose and, if 

25 necessary, treat the patient's condition. The following examples illustrate methods that 
allow direct quantification of hybridization events. 

1. Method for Quantitation of Target and Reference 
Polynucleotides 

Enhanced quantification of binding events between hybridization probes and 
30 target or reference is accomplished by coupling hybridization probes to particles, such 
as beads (hybridization beads). In order to obtain a precise quantitative measure of 
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the amount of a polynucleotide in a sample, hybridization beads are constructed such 
that each bead has attached thereto a single oligonucleotide probe. 

a. Method for Preparation of Probe-Bead Combinations 
A single probe is attached to a bead by incubating a large excess of 
hybridization beads with oligonucleotide probes of a given type (/.a, either first or 
second oligonucleotide probes). Coupling of probe to bead is accomplished using an 
affinity-binding pair. For example, beads may be coated with avidin or streptavidin and 
probes may be labeled with biotin to effect attachment of the probe to the bead. The 
mixture of beads and probes is agitated such that 100% of the probes are bound to a 
bead. The mixture is then exposed to a matrix, such as an affinity column or a 
membrane coated with oligonucleotides that are complementary to the probe. Only 
beads that have an attached probe will adhere to the matrix, the rest being washed 
away. Beads with coupled probe are then released from the matrix by melting 
hybridizations between probe and complement. Multiple exposures to the matrix and 
pre-washing of the column reduces non-specific binding. Moreover, naked beads (/.e. ( 
without attached probe) may be exposed to the matrix to determine a background 
number of beads that can be expected to attach to the matrix in the absence of probe. 

By using a vast excess of beads relative to probe as described above, the vast 
majority of recovered beads will have only one attached probe. For example, if a 
mixture has a ratio of 1 probe to 1000 beads, it is expected that only about 1 bead in a 
million will have two attached probes and even less than one bead in a million will have 
more than two attached probes. Accordingly, hybridization beads are provided in an 
effective 1:1 ratio with probe which allows for precise quantification of target and 
reference polynucleotide as described below. 

For each assay described below, two distinct hybridization beads are used. A 
first hybridization bead has attached thereto a single first oligonucleotide probe that is 
complementary to at least a portion of a target polynucleotide (e.g., a p53 allele). A 
second hybridization bead, of a size distinct from the first hybridization bead, has 
attached thereto a single second oligonucleotide probe that is complementary to at 
least a portion of a reference polynucleotide (/.a, one that is known or suspected not 
to be mutated in the sample). 
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b. Use of Beads to Quantify Target and Reference 
Polynucleotides 

DNA is melted (denatured to form single-stranded DNA) by well-known methods 
See, e.g., Gyllensten, et a/., in Recombinant DNA Methodology II, 565-578 (Wu t ed., 
1995), incorporated by reference herein. One may detect either a coding strand or its 
complement in order to quantify target and/or reference polynucleotide. For purposes 
of illustration, the present example assumes detection of the coding strand. 

2. Removal of Complement 

Single-stranded complement of the target polynucleotide (e.g., p53) and 
reference polynucleotide are removed from the sample by binding to oligonucleotide 
probes that are complementary to target or reference complement. Such probes, 
referred to herein as isolation probes, are attached to isolation beads prior to their 
introduction into the sample. The beads may be magnetized. Thus, when magnetized . 
isolation beads [with attached isolation probe(s)] are introduced into the sample, the 
attached isolation probes hybridize to complement of target or reference (or vice 
versa). Isolation beads preferably are introduced in vast excess in order to saturate 
complement binding. Once hybridization is complete, a magnetic field is applied to the 
sample to draw the magnetized isolation beads (both with and without hybridized 
complement) out of the sample. Assuming that a sufficient quantity of isolation beads 
are introduced into the sample, removal of the isolation beads effectively removes all 
target and reference complement from the sample. 

In an alternative method for complement removal, an excess of oligonucleotide 
probe labeled with biotin is exposed to the melted or dehybridized (single stranded) 
sample under hybridization conditions. Once hybridization is complete, the sample is 
exposed to a column containing immobilized avidin. The biotin-labeled probe, whether 
free or hybridized to complement, is bound by avidin on the column. The remainder of 
the DNA, including target and reference coding strands to be detected, pass through 
the column. In contrast to the description of hybridization beads above, beads for 
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removal of complement may each comprise multiple complementary oligonucleotide 
probes. 

3. Quantitation of Target and Reference 
Two sets of hybridization beads are prepared as described above. Each 
5 member of a first set of hybridization beads (all of which are identical to each other) 
has attached thereto a single oligonucleotide probe that is complementary to at least a 
portion of the target polynucleotide, i.e., the portion of the genome which is altered in 
the cells of a cancerous lesion. Each member of a second set of identical hybridization 
beads (all of which are identical to each other but not to the first set) has attached 

1 0 thereto a single oligonucleotide probe that is complementary to at least a portion of the 
reference polynucleotide, i.e., a portion of the genome which is not likely to be altered ' 
in malignant cells. Members of the second set of hybridization beads are of a size or 
color distinct from that of members of the first set of hybridization beads. First and 
second hybridization beads may also be distinguished on the basis of other 

1 5 characteristics. For example, beads may have fluorescent markers that are 

distinguished by their fluorescence wavelength. Beads with distinct electrochemical 
charges also may be used. The precise modality used for distinguishing beads is not 
essential as long as it is possible to distinguish between first and second probe on the 
basis of distinctions between attached first and second beads. 

20 Both sets of hybridization beads are exposed to the sample under hybridization 

conditions thereby allowing hybridization to reference and target. The sample then is 
washed to remove unhybridized bead/probe combinations. Unhybridized bead/probe 
combinations are removed by, for example, passing the sample through a column lined 
of immobilized DNA complementary to the probe sequence. Thus, any unhybridized 

25 bead/probe combinations are retained on the column while duplex passes through. 
Subsequently, the sample is exposed to means for differentially counting hybridization 
beads in order to quantify first and second hybridization probes which have formed 
duplexes. The numbers obtained provide a precise estimate of the number of copies of 
the reference and target polynucleotide in the population because differential counting 

30 means count individual beads. One bead is equal to one probe which, in turn, signifies 
one copy of the nucleic acid being measured. 
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An example of a differential counting means is an impedance measuring device, 
such as a Coulter counter (Coulter Electronics, Inc., Miami, Florida). Sample is passed 
through the device which differentially detects the two types of hybridization beads by 
measuring their differential impedance of an electric current. Alternatively, the device 
may measure fluorescence, color, or other parameters. In order to increase the speed 
of the assay, a multi-orifice device may be used. A multi-orifice impedance counter is 
shown schematically in figure 2. A multi-orifice array is placed at one end of a column 
filled with an electrically-conductive fluid, such as saline. Hybridization beads with 
either hybridized target or reference segments are inserted at an opposite end of the 
column. Each orifice is large enough to accommodate only one hybridization bead at a 
time and sufficiently wide to allow reliable impedance measurements. A voltage is set 
across each orifice. Each hybridization bead (which is non-conducting), as it passes 
through one of the orifices, displaces a volume of saline, thus creating an impedance 
that is proportional to its size. This, in turn, creates a measurable decrease in current 
that is directly correlated with the size of the bead. By compiling the number of each of 
the two distinct impedance events, a precise estimate of the number of hybridization 
beads and, therefore, the number of probes of each type in the population may be 
obtained. 

Upon quantitative measurement of first and second hybridization beads, the data 
may be analyzed to determine whether any difference between the amounts of first and 
second hybridization beads is statistically significant. A reduction in the amount of 
target relative to the reference is indicative of a mutation in or deletion of the target 
allele in a subpopulation of cells in the sample. Where the p53 gene is the target 
allele, such a mutation is indicative of a cancerous or precancerous condition. A 
clinician may use such results as a basis for prescribing additional treatment, such as 
endoscopy and polypectomy procedures. 

B. Detection of Mutations in Single-base Polymorphisms 

The basic method described above may also be applied to detect a loss of 
heterozygosity or other mutation at a single base polymorphic site between maternal 
and paternal alleles. Such detection is typically an indication of a larger deletion or 
other mutation. However, a mutation at a single polymorphic nucleotide may be all that 
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is necessary to inhibit gene function in one of the two alleles. A mutation in a single- 
base polymorphic region may be difficult to detect due to a recently-discovered 
phenomenon called complementary reduplication. In complementary reduplication, the 
loss of one of two alleles at a particular locus results in "reduplication" of the surviving 
5 allele. Reduplication usually takes place on the chromosome containing the surviving 
allele and involves the production of one or more copies of the surviving allele in close 
proximity on the chromosome to the position of the surviving allele. In the case of a 
locus that displays one or more single-base allelic polymorphisms (/.e., heterozygosity 
at the locus is determined by virtue of one or more single-base differences in one or 

10 more regions of the locus), complementary reduplication results in the insertion on the 
chromosome containing the surviving allele of a duplicate of the sequence 
corresponding to that which was deleted. Even under the most stringent hybridization 
conditions, some of a probe directed against the deleted sequence will bind to the 
reduplicated sequence at a locus of a single-base polymorphism. Accordingly, in such 

15 circumstances, the deletion may not be detected because any true difference in the 
number of probes binding to the polymorphic site (/.e., the allelic region encompassing 
the single-base polymorphism) may be obscured by an increase resulting from the 
other allele's reduplicated region. 

The problems associated with complementary reduplication, and with non- 
20 specific probe binding generally, are alleviated by the practice of the methods 

described herein. Such methods allow detection of a deletion in one of two alleles 
present at a specific locus in a subpopulation of cells contained in a biological sample. 
Numerous alleles, including tumor suppressor alleles, contain single polymorphic 
nucleotides in the context of a constant nucleic acid region. Individuals normally may 
25 be either homozygous or heterozygous for the polymorphic nucleotide. Since 

numerous single-base polymorphic nucleotide sites exist in most alleles, the probability 
that a given individual is heterozygous at least one of the single-base polymorphism 
sites is high. A statistically-significant reduction in one of the two nucleotides at a 
single-base polymorphic site (at which the individual is heterozygous) may be used as 
30 a marker for a deletion in the allele encompassing that site. 
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Genomic regions containing known single-base polymorphisms may be 
identified by reference to a nucleotide database, such as GenBank, EMBL, or any other 
appropriate database. The existence of polymorphisms may be determined by 
methods taught herein, gel electrophoresis or by other standard methods. For 
purposes of the invention, a single-base polymorphism is intended to be a single 
polymorphic nucleotide adjacent to a non-polymorphic region of the allele regardless of 
whether the single polymorphic nucleotide forms part of a larger polymorphic site (i.e. 
the single-base polymorphism may be the terminal nucleotide of a larger, 
polynucleotide polymorphism). For cancer detection, the regions considered are 
regions in which loss of heterozygosity is prevalent, such as regions containing tumor 
suppressor genes. A given individual may be homozygous or heterozygous for the 
polymorphic nucleotide in any identified single-base polymorphic region. Accordingly, 
if a number of single-base polymorphic regions are identified, the probability increases 
that at least one heterozygous single-base polymorphic region is found in a sample. 

Once single-base polymorphic sites are identified, a DNA sample is obtained 
from a patient, e.g., from blood cells, to determine which of those sites is heterozygous 
in normal (i.e., non-cancerous or non pre-cancerous) cells for that individual. Then, a 
stool sample is prepared as described above. Double stranded DNA in the sample is 
converted to single-stranded DNA. Then, either the coding strand or the anti-coding 
strand for both alleles is removed from the sample. As will be evident from the 
following discussion, methods disclosed herein are indifferent as to whether coding 
strand or anti-coding strand is tested. 

An oligonucleotide probe is constructed that is complementary to a portion of the 
region of single-base polymorphism, said portion ending at the nucleotide that is 
immediately 3' to the polymorphic nucleotide, regardless of whether the 5'-3' (coding) 
strand or the 3'-5' (anticoding) strand is used as a template. Figure 3 shows four 
possible probes that are immediately 3' to the polymorphic nucleotide for each of four 
possible template strands as described above (the Sequences in Figure 3 are 
hypothetical and are not intended to represent any actual sequence). The sequence 
labeled M1 is SEQ ID NO:1; the sequence labeled M2 is SEQ ID NO:2; the sequence 
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labeled M3 is SEQ ID NO: 3; the sequence labeled M4 is SEQ ID NO:4; the sequence 
labeled F1 is SEQ ID NO: 5; the sequence labeled F2 is SEQ ID NO: 6; the sequence 
labeled F3 is SEQ ID NO: 7; and the sequence labeled F4 is SEQ ID NO: 8. While 
either strand may be used as a template for probe binding to determine heterozygosity 
and/or the loss thereof, the sequence of the probe that is hybridized to the template will 
be different depending upon the strand used. Probes may be of any length that allows 
efficient and specific hybridization. Figure 3 merely illustrates four hypothetical probes 
that are useful for hybridization to the hypothetical sequence shown. The length of 
probe sequences may be determined as appropriate for each genomic region that is 
analyzed. A preferable length is between about 10 and about 100 nucleotides. The 
size of the probe will also depend upon the size of the region surrounding the single- 
base polymorphism (/.e. t the region 5' or 3' to the next adjacent polymorphism, if any). 
Details concerning the construction and hybridization of oligonucleotide probes are 
known in the art. 

Unique probes for each polymorphic region will hybridize to regions of both 
maternal and paternal alleles up to, but not including, the polymorphic nucleotide, 
which, in a heterozygote, will be different in the maternal and paternal alleles. Figure 3 
shows only a small portion of the region surrounding the polymorphic nucleotide. The 
alleles shown in Figure 3 are heterozygous at the polymorphic site. 

Probe is hybridized to its specific template DNA by standard methods. The 
sample may optionally be washed to remove unhybridized probe. To determine 
whether each target region bound by a probe is heterozygous or homozygous at the 
polymorphic nucleotide, a modification of the dideoxy chain termination method as 
reported in Sanger, Proc. Natl Acad Sti. (USA), 74: 5463-5467 (1977), incorporated 
by reference herein, is used. The method involves using at least two of the four 
common 2\ 3'-dideoxy nucleoside triphosphates (ddATP, ddCTP, ddGTP, and ddTTP). 
A different detectable label is attached to each dideoxy nucleoside triphosphate 
(ddNTP) according to methods known in the art. Differentially-labeled ddNTPs are 
available commercially, for example, from Perkin Elmer Corporation (Cat. No. 401456). 
At least two labeled ddNTPs then are exposed to each sample having probe hybridized 
to maternal and paternal alleles as described above. The choice of which two ddNTPs 
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are used will depend upon the nucleotides at the heterozygous polymorphic site. Any 
3' modified nucleoside triphosphate may be used in the method as long as the 3' 
modification prevents binding of an additional 3' nucleotide (i.e. probe extension) and 
does not inhibit binding of the modified nucleotide to the 3' end of the probe. A DNA 
5 polymerase, such as Sequenase™ (Perkin-Elmer), is added to the sample mixture. 
Using the allelic strands as primer, the polymerase will add one ddNTP to the 3' end of 
the probe, the incorporated ddNTP will be complementary to the nucleotide that exists 
at the single-base polymorphic site. Because the ddNTPs have no 3' hydroxy!, further 
elongation of the hybridized probe will not occur. After completion, the sample is 

10 washed to remove excess ddNTPs. Label is then counted in each sample. The 
presence of two differentially-labeled ddNTPs in a sample is indicative of 
heterozygosity at the polymorphic site. 

It is not necessary to determine the amount of each label present in the sample 
in order to establish heterozygosity or homozygosity. For Example, differentially- 

15 labeled deoxynucleoside triphosphates may be used for a determination of 

heterozygosity or homozygosity. The mere fact that two different labeled dideoxy 
nucleotides are incorporated into the probe means that the single-base polymorphic 
site being analyzed is heterozygous. However, determination of sites at which a 
patient is polymorphic is useful in order to establish a baseline of polymorphisms which 

20 may be used in future tests to detect changes in polymorphic sites which may be 

indicative of cancer. The existence of polymorphisms may be determined by methods 
taught herein, by gel electrophoresis or by other standard methods. 

In the case in which heterozygosity exists at the polymorphic site, counting the 
amount of each of the two differentially-labeled ddNTPs allows a determination of 

25 whether there is a loss of heterozygosity (/.e. f a deletion) in a subpopulation of cells in 
the sample. In a normal (/.e., non-cancerous) sample containing cells that are 
heterozygous at the single-base polymorphic site, it is expected that the detected 
amount of each of the two ddNTPs added to the probe will be identical (within chosen 
limits of statistical significance). However, if a deletion has occurred in one of the two 

30 alleles in a subpopulation of cells in the sample, there will be a statistically-significant 
difference between the amounts of each of the two alleles detected via the incorporated 
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(labeled) ddNTPs. The detection of such a difference is indicative of genomic 
instability within the sample. Such genomic instability indicates the possibility of 
cancerous or pre-cancerous cells in the sample. 

In order to improve the ability to count alleles to which ddNTPs have attached 

5 accurately, ddNTPs are labeled with hybridization-type beads of different sizes as 
described above. Alleles with bound probe comprising a labeled ddNTP are counted 
as described above using a counting device, such as a Coulter counter. Also as 
described above, differential fluorescent labels or other counting means may be used 
to separately detect incorporated ddNTPs. 

1 0 The detection of heterozygosity at single-base polymorphic sites and the 

detection of the loss of heterozygosity may be determined in separate steps. For 
example, probes may be hybridized immediately adjacent to but not including the 
nucleotide determined to be polymorphic as described above. The four ddNTPs may 
then be added to the sample, washed, and the presence or absence of each label may 

15 be detected. Detection of only one label indicates that the individual from whom the 
sample was obtained is homozygous at the site of the potential polymorphic nucleotide. 
Detection of two labels means that the individual is heterozygous. The heterozygous 
loci are recorded. As noted above, baseline determinations of heterozygosity may be 
done.using standard methods. Once a baseline is established, future tests on that 

20 individual are performed exploiting the heterozygous loci in order to detect a loss of 
heterozygosity. For the detection of cancer, the heterozygous loci are typically 
chromosomal areas containing tumor suppressor genes, including p53, dec, ape, and 
others. Using methods described herein, a "fingerprint" of heterozygous tumor 
suppressor loci may be constructed. Future deviation from the fingerprint (/.e. ( 

25 deletions) provides valuable information as to the development of cancer. 

A preferred use of the foregoing methods is in the detection of colon cancer. A 
representative stool sample is prepared as described above. Double-stranded DNA is 
converted to single-stranded DNA and complement of the strand to be detected is 
removed from the sample. The remaining single-stranded DNA is exposed to multiple 

30 copies of a probe designed on the basis of known single-base polymorphisms in a 
cancer-associated allele such that the probe hybridizes with a desired number of 
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nucleotides immediately adjacent the polymorphic nucleotide as described above. 
After hybridization is complete, the sample is washed and exposed to differentially- 
labeled ddNTPs and a DNA polymerase. The sample then is washed to remove 
unincorporated ddNTPs. The presence of any labeled ddNTPs is determined. If two 

5 labels are detected, the individual from whom the sample is obtained is heterozygous 
at the polymorphic nucleotide. The heterozygosity of the allele and the probe 
sequence matching the site immediately adjacent to the polymorphic allele are noted 
for reference in future testing for the loss of heterozygosity. Alternatively, once the 
patient is determined to be heterozygous at a locus, an assay may be performed 

10 immediately in the manner described above in order to determine an existing loss of 
heterozygosity in a subpopulation of cells in the sample. 

C. Analysis of Microsatellite Instability 

Microsatellites are di- or trinucleotide repeats found throughout the genome. A 
particular array of microsatellite repeats is often associated with a particular genomic 
15 sequence and is stably inherited under normal conditions. Expansions of microsatellite 
copy number typically, called "microsatellite instability," are associated with defects in 
mismatch repair. Accordingly, changes in a microsatellite region indicate that the 
patient is at risk for a mutation in other genomic regions. 

In order to detect microsatellite instability as an indicator of a mutation in a 
20 cancer-associated gene, one must first identify a microsatellite region associated with 
the gene of interest. Such regions are typically identified on a database, such as 
GenBank, EMBL, and others. Once a wild-type microsatellite region associated with, 
for example, the p53 tumor suppressor gene, is identified, an oligonucleotide probe is 
constructed that spans the microsatellite region and the regions immediately 5' and 
25 immediately 3' to the microsatellite region. The precise length of probes may be 
determined by the experimenter. , Probes are constructed that hybridize to the 
microsatellite region, including portions extending 5' and 3\ on both the maternal and 
paternal alleles with which the microsatellite is associated (e.g., p53). 

An appropriate sample of body tissue or fluid is obtained and processed as 
30 described herein. Double stranded DNA is denatured and an excess of maternal and 
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paternal probes, as described above, are introduced into the sample under 
hybridization conditions. The probes are detectably labeled as described above. 
Complement of the strands to be detected may optionally be removed by methods 
described above. The sample is then washed to remove unhybridized probe and the 
amount of hybridized probe in quantitatively detected. 

Quantitative detection may be accomplished by any means described herein. 
For example, probes may be attached to hybridization beads such that probes that bind 
to maternal allele are attached to beads of one size and probes that bind to paternal 
allele are attached to beads of a second size that is distinguishable from beads of the 
first size. Beads with attached probe may be counted as described above. 

The detection of a statistically-significant difference between the amount of 
probe binding to the maternal allele and the amount of probe binding to the paternal 
allele is indicative of microsatellite instability. As previously mentioned, microsatellite 
instability can be indicative of a mutation at the locus in which the microsatellite 
resides. If the microsatellite region is associated with a tumor suppressor gene or an 
oncogene, the detection of microsatellite instability in an allele in a subpopulation of 
cells in a biological sample is indicative of the potential for cancer or that cancer or 
precancer may have already developed. Further testing as described herein (either by 
invasive or noninvasive means) may then be conducted. 

In an alternative embodiment, a "fingerprint" of microsatellites is taken from 
regions associated with cancer-causing genes in a sample obtained from a patient. 
Such a fingerprint may be obtained by standard methods. The fingerprint comprises 
the sequence of wild-type microsatellites associated with the cancer-causing gene or 
genes. Once obtained, the fingerprint is stored and is used in future tests of samples 
from the same patient in order to monitor changes in microsatellite regions (i.e. 
microsatellite instability) that may be associated with the development of cancer. 
Changes in microsatellite length and/or sequence over time may be used to prescribe 
additional testing and/or treatment in order to detect and remove cancerous tissue at 
an early stage in its etiology. 

Additional embodiments of the invention are apparent upon consideration of the 
following claims. 
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. CLAIMS 

What is claimed is: 

1 1 . A method of screening for the presence of a disease in a patient, the method 

2 comprising the steps of: 

3 a) obtaining a sample comprising a circumferential surface of a stool 

4 voided by a patient; and 

5 b) performing an assay to detect in the sample characteristics 

6 indicative of the presence of said disease. 

1 2. A method for screening for the presence of a colorectal cancer or precancerous 

2 lesion in a patient, the method comprising the steps of: 

3 a) obtaining a stool voided by a patient; 

4 b) removing a circumferential surface from said stool; and 

5 c) performing an assay on said surface to detect characteristics 

6 indicative of the presence of cells or cellular debris shed onto said 

7 stool. 

1 3. The method of claim 2 t wherein said assay detects debris from a clonal 

2 population of transformed cells comprising said lesion. 

1 4. The method of claim 3, wherein said assay detects a protein expressed by the 

2 transformed cells indicative of the presence of said cells. 

1 5. The method of claim 3, wherein said assay detects a DNA characteristic 

2 indicative of the presence of said cells. 

1 6. The method of claim 1 , further comprising the step of homogenizing said portion 

2 in a physiologically compatible buffer prior to step (b). 

1 7. The method of claim 6, wherein said physiologically compatible buffer comprises 

2 a detergent and a proteinase. 

1 8. The method of claim 1 , wherein said assay detects the presence of 

2 carcinoembryonic antigen shed from said lesion. 
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1 9. The method of claim 1 , wherein said assay comprises the step of exposing said 

2 sample to an antibody which specifically binds a molecule characteristic of the 

3 presence of said debris. 

1 1 0. The method of claim 5, wherein said DNA characteristic is a mutation. 

1 11. The method of claim 1 0, wherein said mutation is selected from the group 

2 consisting of loss of heterozygosity and microsateliite instability. 

1 12. The method of claim 10, wherein said characteristics comprise a deletion in a 

2 tumor suppressor allele. 

1 1 3. The method of claim 1 , wherein said assay comprises the step of determining 

2 whether a difference exists in said sample between a number X of a first allele ■ 

3 known or suspected to be mutated in a subpopulation of cells in the sample and 

4 a number Y of a second allele that is known or suspected not to be mutated in a 

5 subpopulation of cells in the sample, the presence of a statistically-significant 

6 difference being indicative of a mutation in a subpopulation of cells in the 

7 sample and the potential presence of a cancerous or precancerous lesion. 

1 1 4. The method of claim 1 , wherein said assay comprises the step of determining 

2 whether a difference exists between a number of a target tumor suppressor 

3 allele in the sample and a number of a non-cancer-associated reference allele in 

4 the sample, the presence of a statistically-significant difference being indicative 

5 of a deletion of the target tumor suppressor allele in a subpopulation of cells in 

6 the sample and the potential presence of a cancerous or precancerous lesion. 

1 15. The method of claim 5, wherein said characteristics comprise a loss of 

2 heterozygosity encompassing a polymorphic locus. 

1 1 6. The method of claim 1 3, wherein said assay comprises the steps of: 

2 a) detecting an amount of a maternal allele at a polymorphic locus in 

3 the biological sample; 

4 b) detecting an amount of a paternal allele at the polymorphic locus in 

5 the biological sample; and 
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c) determining whether a difference exists between the amounts of 
maternal and paternal allele, 

the presence of a statistically-significant difference being indicative of a deletion 
at the polymorphic locus in a subpopulation of cells in the biological sample and 
the potential presence of a lesion. 

1 7. The method of claim 1 , comprising the additional step of performing a visual 
examination of the colon of a patient presenting positive assay results. 

18. The method of claim 1, wherein said at least cross-sectional portion comprises 
an entire stool voided by a patient. 

1 9. A method for reducing morbidity from colorectal cancer in a population, the 
method comprising the steps of: 

a) obtaining a sample comprising at least a representative portion of 
a stool voided by a patient; 

b) performing an assay to detect in the sample the presence of 
cellular debris shed from the lesion into the voided stool; and 

c) performing a visual examination of the colon of a patient presenting 
positive results in said assay to detect the presence of a lesion. 

20. The method of claim 1 9, wherein said assay detects a loss of DNA in a portion of 
debris shed from a clonal population of transformed epithelial cells comprising 
said lesion. 



1/4 




FIG. 1 



SUBSTITUTE SHEET (RULE26) 



2/4 




FIG. 2 



SUBSTITUTE SHEET (RULE26) 



3/4 




® ® V ® 

1 r i i J < ' 



3 
4 



0 ©® 
p « w 



\ 



® 0 @ @ 



FIG. 3 



SUBSTITUTE SHEET (RULE26) 



4/4 



3'ACGCTACGG5' 
.ATCGGCTTACTGCGATGCC....3 ' 



...TAGCCGAATGACGCTACGG....5 ! 
5'ATCGGCTTA3' 



3'ACGCTACGG5' 
... ATCGGCTTATTGCGATGCC. ...3 ' 



...TAGCCGAATAACGCTACGG....5 ' 
5 ' ATCGGCTTA3 ' 



FIG. 4 



SUBSTITUTE SHEET (RULE26) 



WO 99/53316 



SEQUENCE LISTING 

<110> Lapidus, Stanley N 
Shuber, Anthony p 
EXACT LABORATORIES 



<120> Methods for Disease Diagnosis From Stool Samples 
<130> EXT-015PC 



<140> 
<141> 

<150> US 09/059,713 
<151> 1998-04-13 

<160> 8 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Hypothetical 



<400> 1 
ggcatcgca 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Hypothetical 

MZ 



<400> 2 

atcggcttac tgcgatgcc 

<210> 3 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Hypothetical 



<400> 3 

ggcatcgcag taagccgat 

<210> 4 
<2U> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 

<400> 4 
atcggctta 

<210> 5 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 



WO 99/53316 



2 



<400> 5 
ggcatcgca 

<210> 6 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
<400> 6 

atcggcttat tgcgatgcc 

<210> 7 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
<400> 7 

ggcatcgcaa taagccgat 

<210> 8 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 

<400> 8 
atcggctta 



INTERNATIONAL SEARCH REPORT 



Intern- nal Application No 

PCT/US 99/08013 



A. CLASSIFICATION OF SUBJECT MATTER — 

IPC 6 G01N33/50 C12Q1/68 G01N33/72 G01N33/574 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation Marched (classification system followed by classification symbols) 

IPC 6 G01N C12Q BOIL 



Documentation searched other than minimum documentation to the extent that such documents are Included In the fields searched 



Electron* data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • Citation of document, wlh Indication, where appropriate, of the relevant passages 



Relevant to etalm No. 



X.L 



WO 97 28450 A (EXACT LABORATORIES) 
7 August 1997 (1997-08-07) 
the whole document 

EP 0 391 565 A (HELENA LABORATORIES 
CORPORATION) 10 October 1990 (1990-10-10) 
column 8 f line 15 - column 29; claims 

US 4 578 358 A (N.H.0KSMAN) 
25 March 1986 (1986-03-25) 

abstract; claims 

column 7, line 5 - line 11 



1-20 



1,18 



f~"j Further documents are listed In the continuation of box C. 



ID 



Patent family members are listed ii annex. 



' Special categorise of clad documents : 

•A' document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier ctooiment but pubfished on or after the International 
fIRngdate 

IT document which may threw doubts on priorRy ctaim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0' document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the International ding date but 
later than the prionty date daimed 



T later dement published after the international filing date 
or priority date and not In cortflct with the application but 
cfledtoundersund the principle or theory urtdertyfng the 



"X" document ol parOctiar relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
Involve an Inventive step when the document Is taken alone 

"V document of particular relevance; the claimed hventton 
cannot be considered to Involve an Inventive step when the 
document is combined with one or more other such docuV 
mente, such combination being obvious to a person ski led 
m tne art 

•4" document member of the same patent famly 



Date of the actual completion of the international search 



17 September 1999 



Date of mating of the international search raport 

23/09/1999 



Name and mailing addreee of the ISA 

European Patent Office, P.a 5616 Patertiaan 2 

NL-2280HVRIJswi* 
TeL (+31-70) 340-2040. Tx. 31 651 epo rs, 
Fax (431-70) 340-3016 



Authorized officer 



Luzzatto, E 



Form FCT/ISAOio (seoond eheet) (July 1982) 



INTERNATIONAL SEARCH REPORT 

jrmttlon on patent family members 



Intarn* nal Application No 

PCT/uS 99/08013 



Patent document 
cited In search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



Llrt 07 9 CMC ft 


A 

A 


07-08-1997 


1 If 

US 


CTJI 1 CPA A 

5741650 A 


21-04-1998 








AU 


704696 B 


29-04-1999 








AU 


2240697 A 

4VaW awW^f n 


VO 177/ 








CA 


2215263 A 


07-08-1997 








EP 


0817968 A 


14-01-1998 


EP 391565 


A 


10-10-1990 


US 


5192501 A 


09-03-1993 








CA 


2012830 A 


04-10-1990 








DE 


69026859 D 


13-06-1996 








DE 


69026859 T 


12-09-1996 








JP 


2295495 A 


06-12-1990 








US 


5196167 A 


23-03-1993 








US 


5217874 A 


08-06-1993 


US 4578358 


A 


25-03-1986 


AU 


547546 B 


24-10-1985 








AU 


2470984 A 


08-11-1984 








BR 


8400933 A 


05-03-1985 








CA 


1212891 A 


21-10-1986 








EP 


0124215 A 


07-11-1984 








JP 


1688475 C 


11-08-1992 








JP 


3054305 B 


19-08-1991 








JP 


59204766 A 


20-11-1984 








ZA 


8401079 A 


26-09-1984 



Form PCT/ISA*io (pita* tamly am* Uiy 1M2) 



PCT. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

G01N 33/50, C12Q 1/68, G01N 33/72, 
33/574 



Al 



(11) International Publication Number: WO 99/53316 

(43) International Publication Date: 21 October 1999 (21 .10.99) 



(21) International Application Number: PCT/US99/08013 

(22) International Filing Date: 13 April 1999 (13.04.99) 



(30) Priority Data: 

09/059,713 



13 April 1998 (13.04.98) 



US 



(71) Applicant: EXACT LABORATORIES, INC. [US/US]; 63 

Great Road, Maynard, MA 01754 (US). 

(72) Inventors: LAPIDUS, Stanley, N.; 12 Old Evergreen Road, 

Bedford, NH 031 10 (US). SHUBER, Anthony, P.; 1 1 Grant 
Street, Milford. MA 01757 (US). 

(74) Agent: WALLER, Patrick, R., H.; Testa, Hurwitz & Thibeault, 
LLP, High Street Tower, 125 High Street, Boston, MA 
021 10 (US). 



(81) Designated States: AU, CA, JP, European patent (AT. BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT. SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: METHODS FOR DISEASE DIAGNOSIS FROM STOOL SAMPLES 
(57) Abstract 

The present invention provides methods for preparing a stool sample in order to screen for the presence of indicators of a disease, 
for example a subpopulation of cancerous or precancerous cells. The methods take advantage of the recognition that cellular debris from 
cancerous and precancerous cells is deposited onto only a longitudinal stripe of stool as the stool is forming in the colon. Accordingly, 
methods of the invention comprise obtaining a representative sample, such as a circumferential or cross-sectional sample of stool in order 
to ensure that any disease indicator, such as cellular debris that is shed by colonic cells, is obtained in the sample. 



FOR THE PURPOSES OF INFORMATION ONLY 



Code* used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


AM 


Arm roil 


FI 


AT 


Austria 


FR 


AU 


Australia 


GA 


AZ 


Azerbaijan 


GB 


BA 


Bosnia and Herzegovina 


GE 


BB 


Barbados 


GH 


BE 


Belgium 


GN 


BF 


Burkina Paso 


GR 


BG 


Bulgaria 


HU 


BJ 


Benin 


IE 


BR 


Brazil 


IL 


BY 


Belarus 


IS 


CA 


Canada 


IT 


CF 


Central African Republic 


JP 


CG 


Congo 


KE 


CH 


Switzerland 


KG 


CI 


Cote d'lvoire 


KP 


CM 


Cameroon 




CN 


China 


KR 


cu 


Cuba 


KZ 


cz 


Czech Republic 


LC 


DE 


Germany 


U 


DK 


Denmark 


LK 


EE 


Estonia 


LR 



Spain 
Finland 
Prance 
Gabon 

United Kingdom 
Georgia 
Ghana 
Guinea 

Hungary 

Ireland 

Israel 

Iceland 

Italy 

Japan 

Kenya 

Kyrgyzstan 

Democratic People's 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Luda 

Liechtenstein 

Sri Lanka 

Liberia 



LS 
LT 
LU 
LV 
MC 
MD 
MG 
MK 

ML 

MN 

MR 

MW 

MX 

NE 

NL 

NO 

NZ 

PL 

PT 

RO 

RU 

SD 

SE 

SG 



Lesotho 



Luxembourg 
Latvia 



Republic of Moldova 

Madagascar 

The former Yugoslav 

Republic of Macedonia 

Mali 

Mongolia 

Mauritania 

Malawi 

Mexico 

Niger 

Netherlands 



New Zealand 
Poland 
Portugal 
Romania 

Russian Federation 



SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TM 


Turkmenistan 


TR 


Turkey 


XT 


Trinioad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United State* of America 


UZ 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


zw 


Zimbabwe 



Swedes 
Singapore 



WO 99/5334* 



PCT/US99/08013 



METHODS FOR DISEASE DIAGNOSIS FROM STOOL SAMPLES 



This patent application is a continuation-in-part of U.S. Serial Number 
08/699,678, filed August 14, 1996, the disclosure of which is incorporated herein by 
reference. 

FIELD OF THE INVENTION 

5 This invention relates to methods for disease diagnosis, including the early 

detection of colon cancer in patients, and more particularly to methods for preparing 
stool samples for disease diagnosis, including the detection of colon cancer, so as to 
assure or increase the likelihood that the sample will contain the diagnostically relevant 
information if the patient has a disease, for example a cancerous or precancerous 
1 0 lesion, and to methods for stool sample analysis. 

BACKGROUND OF THE INVENTION 

Stool samples frequently must be prepared for medical diagnostic analysis. 
Stool samples may be analyzed to help diagnose medical conditions ranging from 
parasitic, bacterial or viral infections to inflammatory bowel disease and colorectal 
15 cancer. 

Colorectal cancer is a leading cause of death in Western society. However, if 
diagnosed early, it may be treated effectively by surgical removal of the cancerous 
tissue. Colorectal cancers originate in the colorectal epithelium and typically are not 
extensively vascularized (and therefore not invasive) during the early stages of 

20 development. Colorectal cancer is thought to result from the clonal expansion of a 
single mutant cell in the epithelial lining of the colon or rectum. The transition to a 
highly vascularized, invasive and ultimately metastatic cancer which spreads 
throughout the body commonly takes ten years or longer. If the cancer is detected 
prior to invasion, surgical removal of the cancerous tissue is an effective cure. 

25 However, colorectal cancer is often detected only upon manifestation of clinical 

symptoms, such as pain and black tarry stool. Generally, such symptoms are present 
only when the disease is well established, often after metastasis has occurred, and the 
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prognosis for the patient is poor, even after surgical resection of the cancerous tissue. 
Early detection of colorectal cancer therefore is important in that detection may 
significantly reduce its morbidity. 

Invasive diagnostic methods such as endoscopic examination allow for direct 
5 visual identification, removal, and biopsy of potentially cancerous growths such as 
polyps. Endoscopy is expensive, uncomfortable, inherently risky, and therefore not a 
practical tool for screening populations to identify those with colorectal cancer. Non- 
invasive analysis of stool samples for characteristics indicative of the presence of 
colorectal cancer or precancer is a preferred alternative for early diagnosis, but no 

10 known diagnostic method is available which reliably achieves this goal. 

Current non-invasive diagnostic methods involve assaying stool samples for the 
presence of fecal occult blood or for elevated levels of carcinoembryonic antigen, both 
of which are suggestive of the presence of colorectal cancer. Additionally, recent 
developments in molecular biology provide methods of great potential for detecting the 

15 presence of a range of DNA mutations or alterations associated with and indicative of 
the presence of colorectal cancer. The presence of such mutations theoretically can 
be detected in DNA found in stool samples during the early stages of colorectal cancer. 
However, stool comprises cells and cellular debris from the patient, from 
microorganisms, and from food, resulting in a heterogeneous population of cells. This 

20 makes detection of a small, specific subpopulation impossible to detect reliably. 

Stool diagnostic assays for colorectal cancer described in the art typically are 
performed on samples prepared from randomly sampled portions of voided stool. 
However, samples prepared according to such methods do not reproducibly yield 
characteristics indicative of the presence of colorectal cancer or precancer, even when 

25 prepared from stool voided by a patient with colorectal cancer or precancer. There is 
therefore a need in the art for methods for early diagnosis of colorectal cancer or 
precancer that will reproducibly detect characteristics indicative of the presence of 
cancerous or precancerous material in samples prepared from stool voided by a patient 
with colorectal cancer or precancer. Such methods are provided herein. 
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SUMMARY OF THE INVENTION 

It has now been appreciated that cells and cellular debris are shed from colonic 
epithelial cells onto forming stool in a longitudinal "stripe" of material along the length 
of the stool. The shed material is confined to this longitudinal stripe, as shown in 
5 Figure 1 (designated U C). Based upon this recognition, Applicants teach that stool 
sample preparation for diagnostic testing must include taking a representative sample 
in order to ensure that the sample will contain any cells or cellular debris that was shed 
into the stool as it passed through the colon. Accordingly, methods of the invention 
comprise obtaining at least a representative (e.g. a cross-section or circumferential 

10 surface) portion of stool voided by a patient, and performing an assay to detect in the 
sample the presence of cells or cellular debris shed from epithelial cells lining the colon 
that may be indicative of cancer or precancer. Most often, such cells will be derived 
from a polyp or a cancerous or precancerous lesion at a discrete location along the 
colon. For purposes of the present invention, a precancerous lesion comprises 

1 5 precancerous cells, and precancerous cells are cells that have a mutation that is 
associated with cancer and which renders such cells susceptible to becoming 
cancerous. As shown in Figure 1 , a cross-sectional sample is a sample that contains 
at least a circumferential surface of the stool (or portion of a stool comprising an entire 
cross-sectional portion), as, for example, in a coronal section or a sagittal section. A 

20 sample comprising the surface layer of a stool (or of a cross-section of a stool) also 
contains at least a circumferential surface of the stool. Both cross-sections and 
circumferential surfaces comprise longitudinal stripes of sloughed colonic epithelium, 
and are therefore representative samples. 

In a preferred embodiment, methods of the invention comprise the steps of 

25 obtaining at least a circumferential surface or cross-sectional portion of a stool voided 
by a patient, and performing an assay to detect debris indicative of disease. For 
example, such debris may comprise a clonal subpopulation of cells having one or more 
mutations (for purposes of the present application, a mutation is a deletion, 
substitution, addition, modification, intercalation or rearrangement of DNA). Preferred 

30 methods of the invention comprise detection of characteristics of such transformed 
cells, including, for example, mutations, proteins expressed uniquely or in altered 
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amounts in transformed cells, and blood. Particularly preferred methods of the 
invention comprise obtaining at least a circumferential surface or cross-sectional 
portion of a stool sample, and performing an assay to detect DNA characteristics 
indicative of the presence of a clonal subpopulation of cells in the sample. The clonal 
5 subpopulation may be, for example, a subpopulation of cancerous or precancerous 
cells, having a mutation in, for example, a p53 tumor suppressor gene. Clonal 
subpopulations of cells detected by methods according to the invention are often 
characterized by a massive loss of DNA, resulting in a loss of heterozygosity that 
renders ineffective the gene or genes encompassed by the deletion. Alternative 

1 0 methods of the invention comprise performing an assay to detect the presence of an 
infectious disease, for example debris from a microorganism. 

Methods of the invention also comprise obtaining a representative (/.e. ( cross- 
sectional or circumferential) sample of stool and homogenizing the stool in a buffer, 
such as a buffer comprising a detergent and a proteinase and optionally a DNase 

15 inhibitor. 

In methods according to the invention, an assay performed on at least a 
circumferential surface or cross-sectional portion of stool may be an assay to detect a 
disease when a diagnostic indicator of the disease is incorporated into stool. Methods 
of the invention are useful to detect cellular debris shed from the epithelial lining of the 

20 colon, and from other tissue sources that shed cellular material into the gastrointestinal 
tract and preferably into the colon. Shed material may be from colonic epithelial cells, 
pancreatic cells, or may be indicative of infection (e.g. bacterial or viral nucleic acids or 
proteins). In methods according to the invention, an assay performed on at least a 
circumferential surface or cross-sectional portion of stool may be an assay to detect the 

25 presence of elevated levels of carcinoembryonic antigen shed from cells lining the 
colon. Such an assay may also comprise detecting the presence of occult blood. 
However, methods of the invention preferably comprise an assay wherein the sample is 
exposed to an antibody that specifically binds to a molecule characteristic of cellular 
debris shed from cells comprising a subpopulation of cells having a mutation that is 

30 potentially associated with cancer. 
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Methods of the invention are especially and most preferably useful for detecting 
DNA characteristics indicative of a subpopulation of transformed cells in a 
representative stool sample. The DNA characteristics may be, for example, mutations, 
including loss of heterozygosity, microsatellite instability, and others. An assay for 
5 DNA characteristics in a method of the invention may comprise the step of determining 
whether a difference exists in a number X of a first allele known or suspected to be 
mutated in a subpopulation of cells in a representative stool sample, and a number Y of 
an allele known or suspected not to be mutated in the sample, a statistically-significant 
difference being indicative of a mutation and the possible presence of cancer in a 

10 subpopulation of cells in the sample. In an embodiment of the invention, the difference 
between a number of a tumor suppressor gene and a number of a non-cancer- 
associated gene are compared, a statistically-significant difference in the numbers 
being indicative of a mutation in the tumor suppressor gene. 

Assays useful in the practice of methods according to the invention also include 

1 5 an assay to detect the presence of a deletion or other mutation in a region 

encompassing a polymorphic nucleotide. In such an assay, a number of a polymorphic 
nucleotide present at maternal and paternal alleles, wherein the patient is 
heterozygous for the polymorphic nucleotide, is determined. A statistically significant 
difference between a number of a polymorphic nucleotide in a maternal allele and a 

20 paternal allele is indicative of the presence of a deletion in one of the two alleles. 
Another useful assay is one to detect an infectious disease. 

Methods of the invention typically include, following sample preparation and an 
assay for characteristics of cells or cellular debris, a visual examination of the colon in 
order to determine if a polyp or other lesion is, in fact, present. Finally, surgical 

25 resection of abnormal tissue may be done in order to prevent the spread of cancerous 
or precancerous tissue. 

Accordingly, methods of the invention provide means for screening for the 
presence of a cancerous or precancerous subpopulation of cells in a heterogeneous 
sample, such as a stool sample. Methods of the invention reduce morbidity and 

30 mortality associated with lesions of the colonic epithelium. Moreover, methods of the 
invention comprise more accurate screening methods than are currently available in 
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the art, because current methods take advantage of the observation that cancerous or 
precancerous cells shed debris only onto or into part of the surface of the forming stool. 
The present methods reliably assay over the entire circumference of the stool, thereby 
increasing the likelihood of detecting an abnormality if one exists. Further aspects and 
5 advantages of the invention are contained in the following detailed description thereof. 

DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram of a cylinder which represents a formed stool and shows 
various cross-sections which will contain material from the entire circumference of a 
stool. The Section labeled "A" is a typical coronal section and the section labeled U B" is 
10 a typical sagittal section. The strip labeled U C represents material shed from 
cancerous tissue which is deposited in a longitudinal stripe. 

Figure 2 is a schematic diagram of a receptacle for containing a stool sample. 

Figure 3 is a schematic diagram of a multi-orifice impedance counter; wherein 
reference numeral 1 indicates the direction of flow through the column; reference 
15 numeral 2 indicates a plunger means for forcing material downward in the column; 
reference numerals 3 and 4 are different-sized hybridization beads; reference numeral 
5 is an optional filter for extracting unwanted particles; reference numeral 6 indicates 
an array of orifices for measuring differential impedance; and reference numeral 7 is a 
collection chamber. 

20 Figure 4 is a diagram showing primers useful for the detection of single base 

polymorphisms. 

DETAILED DESCRIPTION OF THE INVENTION 

Methods according to the present invention are useful for the preparation of 
stool samples that will reproducibly contain cells or cellular debris shed from a clonal 

25 population of cancerous or precancerous cells, if such a population is present at any 
site along the colon of a patient. These samples are then used to perform assays to 
detect characteristics indicative of cancer in a highly-reproducible and accurate way. 
In preferred methods of the invention, a sample representative of the surface of a stool 
is obtained. Such methods provide an improvement over the art inasmuch as they 

30 teach removing the portion of stool onto which the colonic epithelium sheds debris. 
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Two preferred methods for obtaining such a portion are cross-sectioning and extracting 
the surface of the stool. In a more preferred method, the surface of a cross-section is 
obtained. Without the recognition that at least a cross-sectional sample must be 
obtained, there is no means for reproducibly obtaining a sample that will contain a 
5 cancerous or precancerous subpopulation of cells, if one exists. 

Methods described in the art do not recognize that, unlike infection by parasites, 
bacteria and viruses, characteristics indicative of the presence of colon cancer, 
especially early stage colon cancer, are found only in a specific portion of voided stool. 
If the sampled portion of stool does not include the portion that happens to contain 

1 0 cells and cellular debris shed from early-stage cancer tissue, the diagnostic assay 
necessarily will fail to detect the characteristics indicative of the presence of colorectal 
cancer in a reliable manner even if homogenized, i.e., will produce a false-negative 
result. Methods of the invention are also useful to detect the presence of bacteria, 
virus or parasites, as they will also be present on the surface of formed stool. 

1 5 Sloughed cells from, for example, a polyp forming in the epithelial lining of the 

colon, or on early stage cancerous lesions, are sloughed onto only the portion of the 
forming stool that comes into contact with the polyp or lesion. Accordingly, in early 
stage disease, only a small portion of the surface layer of the forming stool will contain 
sloughed cells, and if that portion happens not to be taken as part of the sample, an 

20 assay for indicia of colon cancer necessarily will produce a false-negative result. A 
brief review of the anatomy and physiology of the colon will aid in an understanding of 
this phenomenon. 

A typical adult colon is approximately six feet in length, with a diameter of about 
two to three inches. Numerous bends and folds are present throughout its length. The 

25 colon removes water from liquid or semi-liquid waste material that enters the colon, and 
relatively solid stool begins to form in the proximal third of the colon. Epithelial cells 
line the lumen of the colon, and the lumenal surface is organized into microscopic 
crypts. Colorectal epithelial cells are replaced every four to five days. The epithelial 
cells divide rapidly at the base of the crypts and migrate to the apeces, where cells 

30 appear to undergo apoptosis (programmed cell death), and cellular debris is shed into 
the lumen. The lining of the colorectal lumen is elastic and the diameter of the lumen is 
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determined by the volume of stool that is passing through the colon at any given time. 
As a result, the surface of the forming stool passing through the colon is in direct 
contact with the epithelial lining of the lumen. Shed epithelial cells (which may or may 
not have undergone apoptosis) and cellular debris therefore are incorporated onto the 
surface of stool as it passes through the colon. 

Cells and cellular debris from colorectal epithelial cancers therefore also are 
shed onto forming stool. Most colorectal cancers develop in regions of the colon where 
stool is relatively solid, indeed approximately one third of such cancers develop in the 
rectum. Markers indicative of the presence of cancer, including cells, cellular debris, 
DNA, blood, and carcinoembryonic antigen, are shed onto the portion of the forming 
stool that contacts the cancerous tissue as the stool passes through the colon. Since 
the stool is relatively solid, these markers will remain on or near the surface of the stool 
where they were deposited and will not be homogeneously dispersed throughout the 
stool. As stool passes over a cancerous or precancerous growth, material from the 
growth will be deposited along the stool, but only on the part of the stool circumference 
that comes into direct contact with the cancerous or precancerous tissue comprising 
the lesion. Stool voided by a patient with colorectal cancer or precancer is therefore 
characterized by a longitudinal "stripe" of diagnostically relevant material derived from 
the cancerous or precancerous tissue. 

A sample that does not include material from the entire circumference of a stool 
voided by a patient with colorectal cancer or precancer will not reproducibly contain 
material derived from the cancerous or precancerous tissue. Currently, random, non- 
cross-sectional samples ("smears") of voided stool are analyzed in clinical settings. In 
these, sloughed cancerous or precancerous cells and cellular debris have no 
possibility of detection unless the sample happens by chance to contain the portion of 
stool which made contact with the region of the colon from which cells were sloughed. 

Furthermore, cancers typically develop by clonal expansion of a single mutant 
cell, and in the early stages of the disease, i.e., when surgical removal is an effective 
cure, the cancerous lesion will be very small and may lie on a small arc of the 
circumference of the colon. Material derived from such an early stage cancer therefore 
will be shed onto or into stool in a very narrow stripe (labeled C in Figure 1). 
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Consequently, a sample that does not contain the entire circumference of a stool 
voided by a patient with early stage colorectal cancer or precancer only by chance will 
contain material indicative of the presence of the early stage cancerous or 
precancerous condition. However, early detection of colorectal cancer is very 
5 important for effective surgical intervention. The present invention provides methods 
for reproducible early detection of characteristics indicative of the presence of cancer 
or precancer in a patient. 

Similarly, the present invention provides methods for reproducibly detecting an 
indicator of the presence of an infectious agent, such as a virus, bacterium, parasite or 
1 0 other microorganism, in stool from an infected patient. Methods of the invention are 
particularly useful for reproducibly detecting an infectious agent, or an indicator 
characteristic thereof, that is deposited onto narrow area of stool as it forms in the 
colon. 

Analysis of at least a circumferential surface or cross-sectional sample of stool 

1 5 ensures that a disease indicator, incorporated on a stool as it moves through the colon, 
will be present in a diagnostic assay sample. For example, analysis of at least a 
circumferential surface or cross-sectional sample of stool (including a whole stool), as 
shown in Figure 1 , ensures that at least a portion of cells and cellular debris shed from 
any existing cancerous or pre-cancerous cells (even if shed from small early stage 

20 cancerous or pre-cancerous tissue, e.g., small polyps) will be present in the portion of 
the stool sample to be analyzed. Indeed taking at least a circumferential surface or 
cross-section of the stool sample avoids the possibility of analyzing stool portions that 
will not contain sloughed cancerous or precancerous cells even when the patient has 
colorectal cancer or precancer. 

25 Once a circumferential surface or cross-sectional sample of stool is obtained, it 

may be homogenized by known methods to distribute cells and cellular debris 
throughout the sample. An assay then is performed on the homogenate, or an extract 
of the homogenate, to detect the presence of cells and/or cellular debris in the sample. 
The assay may be any one or a combination of histological cellular assays, antibody 

30 based immunoassays (or other formats) designed to detect the presence of a molecule 
characteristic of transformation such as a protein, or DNA-based assays for detecting 
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mutations or genetic characteristics indicative of colorectal cancer. Known assay 
protocols, those disclosed herein or in U.S. Patent No. 5,670,325, or assays hereafter 
developed may be used in the practice of the invention. Non-limiting examples of 
useful known assay protocols include those disclosed in U.S. Patent Nos. 5,137,806 
5 (detection of sequences in selected DNA molecules), 5,348,855 (assay for nucleic acid 
sequences), 5,512,441 (detection of mutant alleles), 5,272,057 and 5,380,645 (RFLP 
analysis), 5,527,676 (detection of p53 gene sequences), 5 t 330,892 (detection of MCC 
gene sequences), 5,352,775 (detection of APC gene sequences), 5,532,108 (detection 
of DCC gene sequences), and in WO96/08514 (monoclonal antibodies against human 

1 0 colon carcinoma-associated antigens), the disclosures of each of which are 

incorporated by reference herein. Alternatively, or additionally, an assay for fecal 
occult blood may be performed as reported in U.S. Patent Nos. 4,333,734 and 
5,196,167, incorporated by reference herein. Assays useful in the context of the 
present invention also include an assay for carcinoembryonic antigen as reported in 

15 U.S. Patent No. 5,380,647, incorporated by reference herein. In addition, assays 

described in USSN 08/808,763; 08/815,576; 08/876,857; 08/876,638; 08/877,333; and 
60/063,219; incorporated by reference herein, are also useful in the context of the 
present invention. Finally, the sample may be prepared, as reported in U.S. Patent No. 
4,857,300, incorporated by reference herein, for histological examination to detect 

20 characteristics indicative of the presence of cancerous or precancerous cells. 

The purpose of any assay protocol used in connection with obtaining at least a 
cross-sectional sample is to identify candidates for subsequent invasive diagnostic 
procedure such as colonoscopy or sigmoidoscopy. The assay accordingly need not 
definitively detect the presence of a cancerous or precancerous lesion, although false 

25 negatives obviously are to be avoided. The goal of the test protocol is not to determine 
whether in the vast quantities of cell debris in the sample there are a few cells which 
bear a mutation commonly associated with early stage transformation, but rather 
whether the sample contains debris indicative of clonal expansion of a mutant cellular 
subpopulation. Maximum benefit will come from an assay designed to detect the likely 

30 presence of a clonally expanded cell population, i.e., of transformed colonic epithelial 
cells which comprise the cancerous or precancerous lesions. Assays having the ability 
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to detect cancer or precancer in early stages are preferred in methods of the invention. 
Assays using polymerase chain reaction (PCR), restriction fragment length 
polymorphism (RFLP) or other methods for nucleic acid analysis may be used to detect 
known DNA characteristics indicative of the presence of colorectal cancer or 
5 precancer. More precise methods for quantitative detection of cellular debris, such as 
DNA fragments or segments, may be used to analyze cross-sectional samples 
according to methods described herein. 

A preferred assay interrogates the sample for DNA characteristics indicative of 
the development of cancer or precancer. However, assays for use with methods of the 
10 invention may detect any abnormal cellular debris shed from clinically-relevant 

transformed tissue. Thus, in accordance with a preferred aspect of the invention, an 
assay is used to detect the presence of characteristics of cells which have experienced 
loss of heterozygosity, microsatellite instability or other mutation. 

The following examples provide details of methods according to the invention. 
15 However, numerous additional aspects of the invention, especially in terms of assays to 
be performed, will become apparent upon consideration of the following detailed 
description thereof. 

Example 1 

Preparation of a Stool Sample 

20 A sample is prepared such that it contains at least a circumferential surface or 

cross-sectional portion of a stool voided by a patient. 

Stool is voided into a receptacle that is preferably small enough to be 
transported to a testing facility. The receptacle may be fitted to a conventional toilet 
such that the receptacle accepts stool voided in a conventional manner. The 

25 receptacle may comprise a mesh or screen of sufficient size and placement such that 
stool is retained while urine is allowed to pass through the mesh or screen and into the 
toilet. The receptacle additionally may comprise means for removing a circumferential 
surface or cross-sectional portion from the stool. Moreover, the receptacle may 
comprise means for introducing homogenization buffer or one or more preservatives, 

30 such as alcohol, a solution of high salt concentration, antibiotics, and chaotropic salts 
in order to neutralize bacteria present in the stool sample. The homogenization buffer 
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may be a physiologically compatible buffer such as phosphate buffered saline, and may 
comprise salt such as 20-100 mM NaCI or KCI. The homogenization buffer may also 
comprise a detergent, such as 1-10% SDS or triton, and/or a proteinase, such as 
proteinase K. The buffer may also contain inhibitors of DNA and RNA degrading 
5 enzymes. 

The receptacle, whether adapted to fit a toilet or simply adapted for receiving the 
voided stool sample, should include sealing means sufficient to contain the voided 
stool sample and any solution added thereto and to prevent the emanation of odors. 
An exemplary receptacle is shown in Figure 2. A shown in that figure, the receptacle 

10 has a support frame 1 which is placed directly over the toilet bowl 2. The support frame 
1 has attached thereto an articulating cover 3 which may be placed in a raised position, 
as shown in figure 2, for depositing of sample or a closed position (not shown) for 
sealing voided stool within the receptacle. The support frame 1 additionally has a 
central opening 4 traversing from a top surface 5 through to a bottom surface 6 of the 

1 5 support frame 1 . The bottom surface 6 directly communicates with a top surface 7 of 
the toilet 2. Extending from the bottom surface 6 of the support frame 1 is a means 8 
for capturing voided stool. Means 8 may be fixedly attached to the support frame 1 or 
may be removably attached for removal subsequent to deposition of stool. Means 8 
may comprise a further means for removing at least a circumferential surface or cross- 

20 sectional portion from the voided stool. A preferable sample size is at least 5-1 0 g or at 
least 5-10 ml. A means to assess the presence of a minimal sample size may comprise 
a physical diagram indicating the minimal sample size. Alternatively a means to assess 
the presence of a minimal sample size may comprise the displacement of a liquid or of 
a mechanical device to a minimal level upon deposit of the stool sample. 

25 In a preferred embodiment, a cross-sectional portion is removed from the voided 

stool by making one or more sagittal or coronal section through the stool, as shown in 
Figure 1. The removed portion comprises material from an entire circumferential 
surface of the stool. Alternatively, a circumferential surface of a stool is prepared by 
taking the entire surface layer of the stool, or the entire surface layer of a cross- 

30 sectional portion of the stool. In a preferred embodiment, the surface layer is 

approximately 0.001 to 0.1 mm thick. In another preferred embodiment, the surface 
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layer is approximately 0.1 to 20 mm thick. In a more preferred embodiment, the surface 
layer is approximately 5 mm thick. In a most preferred embodiment, the surface layer is 
representative of material deposited on the forming stool, and reproducibly contains 
markers or indicators, such as shed cellular material, of a disease that is present in the 
5 patient. In one embodiment, a stool surface layer is obtained by cutting or scraping off 
the surface of the stool or cross-sectional portion of the stool. In a preferred 
embodiment, a stool surface layer is prepared using a cylindrical cutting device (for 
example a rigid loop) that has a diameter that is smaller than the diameter of the stool 
or cross-sectional portion of the stool. The surface layer is obtained when the stool 

10 sample is forced through the cutting device. In a preferred embodiment, the diameter 
of the cutting device is adjustable. Alternatively, a circumferential surface of a stool is 
obtained by contacting the surface of a stool or cross-sectional portion of a stool with a 
sampling device. In one embodiment, the sampling device is first wrapped around the 
stool sample and subsequently removed or peeled off. The layer of stool that is peeled 

15 off with the sampling device is then processed as described herein. Alternatively, the 
stool sample is rolled along the surface of the sampling device, and the layer of stool 
that adheres to the surface is then processed as described herein. Alternatively, a 
diagnostic assay is performed directly on the layer of material on the sampling device. 
It is important that the entire circumference of the stool is sampled. In a preferred 

20 embodiment, the sampling device has a surface that stool adheres to. In a more 
preferred embodiment, the sampling device surface preferentially binds diagnostic 
markers or indicators of a disease (for example nucleic acid or protein molecules), and 
thereby removes them from the surface of the stool sample. In one embodiment, the 
sampling device is an adherent paper or membrane. In a preferred embodiment, a 

25 stool is frozen prior to processing. In one embodiment the stool is frozen to between -5 
and -30 °C. In a preferred embodiment the cross-sectional portion or circumference of 
the stool is obtained from the frozen or partially thawed stool. In an alternative 
embodiment, a whole stool may be used for diagnostic assays. The stool or portion of 
stool that is obtained contains sufficient material to allow subsequent diagnostic assays 

30 to be performed. 
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Once obtained, the entire circumference or cross-sectional sample of stool is 
homogenized in an appropriate buffer, such as phosphate buffered saline. 
Homogenization means and materials for homogenization are generally known in the 
art. Thus, particular homogenization methods may be selected by the skilled artisan 
5 and may depend upon the assay to be employed. The buffer may contain detergent, 
salt, proteinase, inhibitors of DNA and RNA degrading enzymes. The composition of 
the buffer will depend on the type of assay to be performed. If a fecal occult blood 
assay is to be performed, the buffer may contain chemical compounds which react with 
blood to produce a color, the intensity of which can be measured. Buffers useful for 
1 0 detecting the presence of fecal occult blood are known in the art. If a test is to be 
performed for the presence of a particular protein, the buffer should not contain a 
proteinase capable of degrading such tumor marking antigens. 

DNA or RNA may be isolated from the homogenate using methods known in the 
art. Subsequent tests may be performed on the isolated DNA and RNA. 

15 Example 2 

Exemplary Enumerative methods for Detection of Colorectal Cancer or 
Precancer in stool samples 

DNA characteristics associated with the presence of colorectal cancer or a 
precancerous lesion may be detected in stool samples prepared according to the 
20 invention, using, for example, the methods described in the following sections. A 
careful endoscopic examination preferably is performed on positive Individuals, 
followed by early surgical excision of any diseased tissue. 

A. Reference-Target 

Methods of the invention are used to prepare a stool sample followed by 
25 detection of a deletion or other mutation in the p53 tumor suppressor gene. The p53 
gene is a good choice because a loss of heterozygosity in p53 is often associated with 
colorectal cancer. An mRNA sequence corresponding to the DNA coding region for 
p53 is reported as GenBank Accession No. M92424. At least a cross-section of a 
voided stool sample is obtained and prepared according to methods of the invention as 
30 described immediately above. The sample need not be further processed for analysis. 
However, DNA or RNA may optionally be isolated from the sample according to 
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methods known in the art. See, Smith-Ravin, et a/., Gut, 36: 81-86 (1995), 
incorporated by reference herein. 

Nucleic acids may be sheared or cut into small fragments by, for example, 
restriction digestion. The size of nucleic acid fragments produced is not critical, subject 
5 to the limitations described below. A target allele that is suspected of being mutated 
(p53 in this example) and a reference allele are chosen. A reference allele may be any 
allele known normally not to be mutated in colon cancer. 

Either portions of a coding strand or its complement may be detected. For 
exemplification, detection of the coding strand of p53 and reference allele are 

10 described herein. Complement to both p53 and reference allele are removed by 
hybridization to anti-complement oligonucleotide probes (isolation probes) and 
subsequent removal of duplex formed thereby. Methods for removal of complement 
strands from a mixture of single-stranded oligonucleotides are known and include 
techniques such as affinity chromatography. Upon converting double-stranded DNA to 

15 single-stranded DNA [See, e.g., Sambrook, et a/., Molecular Cloning, A Laboratory 
Manual (1989) incorporated by reference herein], sample is passed through an affinity 
column packed with bound isolation probe that is complementary to the sequence to be 
isolated away from the sample. Conventional column chromatography is appropriate 
for isolation of complement. An affinity column packed with sepharose or other 

20 appropriate materials with attached complementary nucleotides may be used to isolate 
complement DNA in the column, while allowing DNA to be analyzed to pass through the 
column. See Sambrook, Supra. As an alternative, isolation beads may be used to 
exclude complement as discussed in detail below. 

After removal of complement strands, first oligonucleotide probes which 

25 hybridize to at least a portion of the p53 allele and second oligonucleotide probes that 
hybridize to at least a portion of the reference allele are obtained. The probes are 
labeled with a detectable label, such as fluorescein or with detectable particles. 
Distinct labels for the probes are preferred. However, identical labels may be used if, 
for example, sample is assayed in two separate aliquots. Probes may be labeled with 

30 identical or with distinct labels. However, distinct labels are preferred. 
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Labeled probes then are exposed to sample under hybridization conditions. 
Such conditions are well-known in the art. See, e.g., Wallace, et al., Nucleic Acids 
Res., 6:3543-3557 (1979), incorporated by reference herein. First and Second 
oligonucleotide probes that are distinctly labeled (i.e. with different radioactive 
5 isotopes, fluorescent means, or with beads of different size, See infra) are applied to a 
single aliquot of sample. After exposure of the probes to sample under hybridization 
conditions, sample is washed to remove any unhybridized probe. Thereafter, 
hybridized probes are detected separately for p53 hybrids and reference allele hybrids. 
Standards may be used to establish background and to equilibrate results. Also, if 

1 0 differential fluorescent labels are used, the number of probes may be determined by 
counting differential fluorescent events in a sample that has been diluted sufficiently to 
enable detection of single fluorescent events in the sample. Duplicate samples may be 
analyzed in order to confirm the accuracy of results obtained. 

If there is a statistically-significant difference between the amount of p53 

1 5 detected and the amount of the reference allele detected, it may be assumed that a 
mutation has occurred in p53 so as to alter its sequence and prevent hybridization of 
the probe, or that at least a portion of the region of the genome containing p53 has 
been lost in a subpopulation of cells shed from the colon. The patient therefore may be 
at risk for developing or may have developed colon cancer. Statistical significance may 

20 be determined by any known method. See, e.g., Steel, et al., Principles and 

Procedures of Statistics: A Biometrical Approach (McGraw, Hill, 1980). Statistical 
methods are also outlined in co-owned U.S. Patent No. 5,670,325. 

The determination of a p53 mutation allows a clinician to recommend further 
treatment, such as endoscopy procedures, in order to further diagnose and, if 

25 necessary, treat the patient's condition. The following examples illustrate methods that 
allow direct quantification of hybridization events. 

1. Method for Quantitation of Target and Reference 
Polynucleotides 

Enhanced quantification of binding events between hybridization probes and 
30 target or reference is accomplished by coupling hybridization probes to particles, such 
as beads (hybridization beads). In order to obtain a precise quantitative measure of 
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the amount of a polynucleotide in a sample, hybridization beads are constructed such 
that each bead has attached thereto a single oligonucleotide probe. 

a. Method for Preparation of Probe-Bead Combinations 
A single probe is attached to a bead by incubating a large excess of 
5 hybridization beads with oligonucleotide probes of a given type (/.e., either first or 
second oligonucleotide probes). Coupling of probe to bead is accomplished using an 
affinity-binding pair. For example, beads may be coated with avidin or streptavidin and 
probes may be labeled with biotin to effect attachment of the probe to the bead. The 
mixture of beads and probes is agitated such that 100% of the probes are bound to a 

10 bead. The mixture is then exposed to a matrix, such as an affinity column or a 

membrane coated with oligonucleotides that are complementary to the probe. Only 
beads that have an attached probe will adhere to the matrix, the rest being washed 
away. Beads with coupled probe are then released from the matrix by melting 
hybridizations between probe and complement. Multiple exposures to the matrix and 

1 5 pre-washing of the column reduces non-specific binding. Moreover, naked beads (/.&, 
without attached probe) may be exposed to the matrix to determine a background 
number of beads that can be expected to attach to the matrix in the absence of probe. 

By using a vast excess of beads relative to probe as described above, the vast 
majority of recovered beads will have only one attached probe. For example, if a 

20 mixture has a ratio of 1 probe to 1000 beads, it is expected that only about 1 bead in a 
million will have two attached probes and even less than one bead in a million will have 
more than two attached probes. Accordingly, hybridization beads are provided in an 
effective 1:1 ratio with probe which allows for precise quantification of target and 
reference polynucleotide as described below. 

25 For each assay described below, two distinct hybridization beads are used. A 

first hybridization bead has attached thereto a single first oligonucleotide probe that is 
complementary to at least a portion of a target polynucleotide (e.g., a p53 allele). A 
second hybridization bead, of a size distinct from the first hybridization bead, has 
attached thereto a single second oligonucleotide probe that is complementary to at 

30 least a portion of a reference polynucleotide (/.e., one that is known or suspected not 
to be mutated in the sample). 
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b. Use of Beads to Quantify Target and Reference 
Polynucleotides 

5 DNA is melted (denatured to form single-stranded DNA) by well-known methods 

See, e.g., Gyllensten, ef a/. t in Recombinant DNA Methodology //, 565-578 (Wu f ed., 
1995), incorporated by reference herein. One may detect either a coding strand or its 
complement in order to quantify target and/or reference polynucleotide. For purposes 
of illustration, the present example assumes detection of the coding strand. 

10 

2. Removal of Complement 
Single-stranded complement of the target polynucleotide (e.g., p53) and 
reference polynucleotide are removed from the sample by binding to oligonucleotide 
probes that are complementary to target or reference complement. Such probes, 

15 referred to herein as isolation probes, are attached to isolation beads prior to their 
introduction into the sample. The beads may be magnetized. Thus, when magnetized 
isolation beads [with attached isolation probe(s)] are introduced into the sample, the 
attached isolation probes hybridize to complement of target or reference (or vice 
versa). Isolation beads preferably are introduced in vast excess in order to saturate 

20 complement binding. Once hybridization is complete, a magnetic field is applied to the 
sample to draw the magnetized isolation beads (both with and without hybridized 
complement) out of the sample. Assuming that a sufficient quantity of isolation beads 
are introduced into the sample, removal of the isolation beads effectively removes all 
target and reference complement from the sample. 

25 In an alternative method for complement removal, an excess of oligonucleotide 

probe labeled with biotin is exposed to the melted or dehybridized (single stranded) 
sample under hybridization conditions. Once hybridization is complete, the sample is 
exposed to a column containing immobilized avidin. The biotin-labeled probe, whether 
free or hybridized to complement, is bound by avidin on the column. The remainder of 

30 the DNA, including target and reference coding strands to be detected, pass through 
the column. In contrast to the description of hybridization beads above, beads for 
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removal of complement may each comprise multiple complementary oligonucleotide 
probes. 

3. Quantitation of Target and Reference 
Two sets of hybridization beads are prepared as described above. Each 
5 member of a first set of hybridization beads (all of which are identical to each other) 
has attached thereto a single oligonucleotide probe that is complementary to at least a 
portion of the target polynucleotide, i.e., the portion of the genome which is altered in 
the cells of a cancerous lesion. Each member of a second set of identical hybridization 
beads (all of which are identical to each other but not to the first set) has attached 

10 thereto a single oligonucleotide probe that is complementary to at least a portion of the 
reference polynucleotide, i.e., a portion of the genome which is not likely to be altered ' 
in malignant cells. Members of the second set of hybridization beads are of a size or 
color distinct from that of members of the first set of hybridization beads. First and 
second hybridization beads may also be distinguished on the basis of other 

1 5 characteristics. For example, beads may have fluorescent markers that are 

distinguished by their fluorescence wavelength. Beads with distinct electrochemical 
charges also may be used. The precise modality used for distinguishing beads is not 
essential as long as it is possible to distinguish between first and second probe on the 
basis of distinctions between attached first and second beads. 

20 Both sets of hybridization beads are exposed to the sample under hybridization 

conditions thereby allowing hybridization to reference and target. The sample then is 
washed to remove unhybridized bead/probe combinations. Unhybridized bead/probe 
combinations are removed by, for example, passing the sample through a column lined 
of immobilized DNA complementary to the probe sequence. Thus, any unhybridized 

25 bead/probe combinations are retained on the column while duplex passes through. 
Subsequently, the sample is exposed to means for differentially counting hybridization 
beads in order to quantify first and second hybridization probes which have formed 
duplexes. The numbers obtained provide a precise estimate of the number of copies of 
the reference and target polynucleotide in the population because differential counting 

30 means count individual beads. One bead is equal to one probe which, in turn, signifies 
one copy of the nucleic acid being measured. 
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An example of a differential counting means is an impedance measuring device, 
such as a Coulter counter (Coulter Electronics, Inc., Miami, Florida). Sample is passed 
through the device which differentially detects the two types of hybridization beads by 
measuring their differential impedance of an electric current. Alternatively, the device 
may measure fluorescence, color, or other parameters. In order to increase the speed 
of the assay, a multi-orifice device may be used. A multi-orifice impedance counter is 
shown schematically in figure 2. A multi-orifice array is placed at one end of a column 
filled with an electrically-conductive fluid, such as saline. Hybridization beads with 
either hybridized target or reference segments are inserted at an opposite end of the 
column. Each orifice is large enough to accommodate only one hybridization bead at a 
time and sufficiently wide to allow reliable impedance measurements. A voltage is set 
across each orifice. Each hybridization bead (which is non-conducting), as it passes 
through one of the orifices, displaces a volume of saline, thus creating an impedance 
that is proportional to its size. This, in turn, creates a measurable decrease in current 
that is directly correlated with the size of the bead. By compiling the number of each of 
the two distinct impedance events, a precise estimate of the number of hybridization 
beads and, therefore, the number of probes of each type in the population may be 
obtained. 

. Upon quantitative measurement of first and second hybridization beads, the data 
may be analyzed to determine whether any difference between the amounts of first and 
second hybridization beads is statistically significant. A reduction in the amount of 
target relative to the reference is indicative of a mutation in or deletion of the target 
allele in a subpopulation of cells in the sample. Where the p53 gene is the target 
allele, such a mutation is indicative of a cancerous or precancerous condition. A 
clinician may use such results as a basis for prescribing additional treatment, such as 
endoscopy and polypectomy procedures. 

B. Detection of Mutations in Single-base Polymorphisms 

The basic method described above may also be applied to detect a loss of 
heterozygosity or other mutation at a single base polymorphic site between maternal 
and paternal alleles. Such detection is typically an indication of a larger deletion or 
other mutation. However, a mutation at a single polymorphic nucleotide may be all that 
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is necessary to inhibit gene function in one of the two alleles. A mutation in a single- 
base polymorphic region may be difficult to detect due to a recently-discovered 
phenomenon called complementary reduplication. In complementary reduplication, the 
loss of one of two alleles at a particular locus results in "reduplication" of the surviving 
5 allele. Reduplication usually takes place on the chromosome containing the surviving 
allele and involves the production of one or more copies of the surviving allele in close 
proximity on the chromosome to the position of the surviving allele. In the case of a 
locus that displays one or more single-base allelic polymorphisms (/.a, heterozygosity 
at the locus is determined by virtue of one or more single-base differences in one or 

10 more regions of the locus), complementary reduplication results in the insertion on the 
chromosome containing the surviving allele of a duplicate of the sequence 
corresponding to that which was deleted. Even under the most stringent hybridization 
conditions, some of a probe directed against the deleted sequence will bind to the 
reduplicated sequence at a locus of a single-base polymorphism. Accordingly, in such 

15 circumstances, the deletion may not be detected because any true difference in the 
number of probes binding to the polymorphic site (/.e., the allelic region encompassing 
the single-base polymorphism) may be obscured by an increase resulting from the 
other allele's reduplicated region. 

The problems associated with complementary reduplication, and with non- 
20 specific probe binding generally, are alleviated by the practice of the methods 

described herein. Such methods allow detection of a deletion in one of two alleles 
present at a specific locus in a subpopulation of cells contained in a biological sample. 
Numerous alleles, including tumor suppressor alleles, contain single polymorphic 
nucleotides in the context of a constant nucleic acid region. Individuals normally may 
25 be either homozygous or heterozygous for the polymorphic nucleotide. Since 

numerous single-base polymorphic nucleotide sites exist in most alleles, the probability 
that a given individual is heterozygous at least one of the single-base polymorphism 
sites is high. A statistically-significant reduction in one of the two nucleotides at a 
single-base polymorphic site (at which the individual is heterozygous) may be used as 
30 a marker for a deletion in the allele encompassing that site. 
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Genomic regions containing known single-base polymorphisms may be 
identified by reference to a nucleotide database, such as GenBank, EMBL, or any other 
appropriate database. The existence of polymorphisms may be determined by 
methods taught herein, gel electrophoresis or by other standard methods. For 
5 purposes of the invention, a single-base polymorphism is intended to be a single 

polymorphic nucleotide adjacent to a non-polymorphic region of the allele regardless of 
whether the single polymorphic nucleotide forms part of a larger polymorphic site (i.e. 
toe single-base polymorphism may be the terminal nucleotide of a larger, 
polynucleotide polymorphism). For cancer detection, the regions considered are 
10 regions in which loss of heterozygosity is prevalent, such as regions containing tumor 
suppressor genes. A given individual may be homozygous or heterozygous for the 
polymorphic nucleotide in any identified single-base polymorphic region. Accordingly, 
if a number of single-base polymorphic regions are identified, the probability increases 
that at least one heterozygous single-base polymorphic region is found in a sample. 

15 Once single-base polymorphic sites are identified, a DNA sample is obtained 

from a patient, e.g., from blood cells, to determine which of those sites is heterozygous 
in normal (/.e M non-cancerous or non pre-cancerous) cells for that individual. Then, a 
stool sample is prepared as described above. Double stranded DNA in the sample is 
converted to single-stranded DNA. Then, either the coding strand or the anti-coding 

20 strand for both alleles is removed from the sample. As will be evident from the 

following discussion, methods disclosed herein are indifferent as to whether coding 
strand or anti-coding strand is tested. 

An oligonucleotide probe is constructed that is complementary to a portion of the 
region of single-base polymorphism, said portion ending at the nucleotide that is 

25 immediately 3' to the polymorphic nucleotide, regardless of whether the 5'-3' (coding) 
strand or the 3-5' (anticoding) strand is used as a template. Figure 3 shows four 
possible probes that are immediately 3' to the polymorphic nucleotide for each of four 
possible template strands as described above (the Sequences in Figure 3 are 
hypothetical and are not intended to represent any actual sequence). The sequence 

30 labeled M1 is SEQ ID NO:1; the sequence labeled M2 is SEQ ID NO:2; the sequence 
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labeled M3 is SEQ ID NO: 3; the sequence labeled M4 is SEQ ID NO:4; the sequence 
labeled F1 is SEQ ID NO: 5; the sequence labeled F2 is SEQ ID NO: 6; the sequence 
labeled F3 is SEQ ID NO: 7; and the sequence labeled F4 is SEQ ID NO: 8. While 
either strand may be used as a template for probe binding to determine heterozygosity 
5 and/or the loss thereof, the sequence of the probe that is hybridized to the template will 
be different depending upon the strand used. Probes may be of any length that allows 
efficient and specific hybridization. Figure 3 merely illustrates four hypothetical probes 
that are useful for hybridization to the hypothetical sequence shown. The length of 
probe sequences may be determined as appropriate for each genomic region that is 
10 analyzed. A preferable length is between about 10 and about 100 nucleotides. The 
size of the probe will also depend upon the size of the region surrounding the single- 
base polymorphism {i.e., the region 5' or 3' to the next adjacent polymorphism, if any). 
Details concerning the construction and hybridization of oligonucleotide probes are 
known in the art. 

1 5 Unique probes for each polymorphic region will hybridize to regions of both 

maternal and paternal alleles up to, but not including, the polymorphic nucleotide, 
which, in a heterozygote, will be different in the maternal and paternal alleles. Figure 3 
shows only a small portion of the region surrounding the polymorphic nucleotide. The 
alleles shown in Figure 3 are heterozygous at the polymorphic site. 
20 Probe is hybridized to its specific template DNA by standard methods. The 

sample may optionally be washed to remove unhybridized probe. To determine 
whether each target region bound by a probe is heterozygous or homozygous at the 
polymorphic nucleotide, a modification of the dideoxy chain termination method as 
reported in Sanger, Proc. Nat'lAcad. Sci. (USA), 74: 5463-5467 (1977). incorporated 
25 by reference herein, is used. The method involves using at least two of the four 

common 2', 3'-dideoxy nucleoside triphosphates (ddATP, ddCTP, ddGTP, and ddTTP). 
A different detectable label is attached to each dideoxy nucleoside triphosphate 
(ddNTP) according to methods known in the art. Differentially-labeled ddNTPs are 
available commercially, for example, from Perkin Elmer Corporation (Cat. No. 401456). 
At least two labeled ddNTPs then are exposed to each sample having probe hybridized 
to maternal and paternal alleles as described above. The choice of which two ddNTPs 
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are used will depend upon the nucleotides at the heterozygous polymorphic site. Any 
3' modified nucleoside triphosphate may be used in the method as long as the 3' 
modification prevents binding of an additional 3' nucleotide (i.e. probe extension) and 
does not inhibit binding of the modified nucleotide to the 3' end of the probe. A DNA 
5 polymerase, such as Sequenase™ (Perkin-Elmer), is added to the sample mixture. 
Using the allelic strands as primer, the polymerase will add one ddNTP to the 3' end of 
the probe, the incorporated ddNTP will be complementary to the nucleotide that exists 
at the single-base polymorphic site. Because the ddNTPs have no 3' hydroxyl, further 
elongation of the hybridized probe will not occur. After completion, the sample is 

10 washed to remove excess ddNTPs. Label is then counted in each sample. The 
presence of two differentially-labeled ddNTPs in a sample is indicative of 
heterozygosity at the polymorphic site. 

It is not necessary to determine the amount of each label present in the sample 
in order to establish heterozygosity or homozygosity. For Example, differentially- 

1 5 labeled deoxynucleoside triphosphates may be used for a determination of 

heterozygosity or homozygosity. The mere fact that two different labeled dideoxy 
nucleotides are incorporated into the probe means that the single-base polymorphic 
site being analyzed is heterozygous. However, determination of sites at which a 
patient is polymorphic is useful in order to establish a baseline of polymorphisms which 

20 may be used in future tests to detect changes in polymorphic sites which may be 

indicative of cancer. The existence of polymorphisms may be determined by methods 
taught herein, by gel electrophoresis or by other standard methods. 

In the case in which heterozygosity exists at the polymorphic site, counting the 
amount of each of the two differentially-labeled ddNTPs allows a determination of 

25 whether there is a loss of heterozygosity (/.e., a deletion) in a subpopulation of cells in 
the sample. In a normal (/.e. t non-cancerous) sample containing cells that are 
heterozygous at the single-base polymorphic site, it is expected that the detected 
amount of each of the two ddNTPs added to the probe will be identical (within chosen 
limits of statistical significance). However, if a deletion has occurred in one of the two 

30 alleles in a subpopulation of cells in the sample, there will be a statistically-significant 
difference between the amounts of each of the two alleles detected via the incorporated 
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(labeled) ddNTPs. The detection of such a difference is indicative of genomic 
instability within the sample. Such genomic instability indicates the possibility of 
cancerous or pre-cancerous cells in the sample. 

In order to improve the ability to count alleles to which ddNTPs have attached 
accurately, ddNTPs are labeled with hybridization-type beads of different sizes as 
described above. Alleles with bound probe comprising a labeled ddNTP are counted 
as described above using a counting device, such as a Coulter counter. Also as 
described above, differential fluorescent labels or other counting means may be used 
to separately detect incorporated ddNTPs. 

The detection of heterozygosity at single-base polymorphic sites and the 
detection of the loss of heterozygosity may be determined in separate steps. For 
example, probes may be hybridized immediately adjacent to but not including the 
nucleotide determined to be polymorphic as described above. The four ddNTPs may 
then be added to the sample, washed, and the presence or absence of each label may 
be detected. Detection of only one label indicates that the individual from whom the 
sample was obtained is homozygous at the site of the potential polymorphic nucleotide. 
Detection of two labels means that the individual is heterozygous. The heterozygous 
loci are recorded. As noted above, baseline determinations of heterozygosity may be 
done using standard methods. Once a baseline is established, future tests on that 
individual are performed exploiting the heterozygous loci in order to detect a loss of 
heterozygosity. For the detection of cancer, the heterozygous loci are typically 
chromosomal areas containing tumor suppressor genes, including p53, dec, ape, and 
others. Using methods described herein, a "fingerprint" of heterozygous tumor 
suppressor loci may be constructed. Future deviation from the fingerprint (i.e., 
deletions) provides valuable information as to the development of cancer. 

A preferred use of the foregoing methods is in the detection of colon cancer. A 
representative stool sample is prepared as described above. Double-stranded DNA is 
converted to single-stranded DNA and complement of the strand to be detected is 
removed from the sample. The remaining single-stranded DNA is exposed to multiple 
copies of a probe designed on the basis of known single-base polymorphisms in a 
cancer-associated allele such that the probe hybridizes with a desired number of 
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nucleotides immediately adjacent the polymorphic nucleotide as described above. 
After hybridization is complete, the sample is washed and exposed to differentially- 
labeled ddNTPs and a DNA polymerase. The sample then is washed to remove 
unincorporated ddNTPs. The presence of any labeled ddNTPs is determined. If two 
5 labels are detected, the individual from whom the sample is obtained is heterozygous 
at the polymorphic nucleotide. The heterozygosity of the allele and the probe 
sequence matching the site immediately adjacent to the polymorphic allele are noted 
for reference in future testing for the loss of heterozygosity. Alternatively, once the 
patient is determined to be heterozygous at a locus, an assay may be performed 
10 immediately in the manner described above in order to determine an existing loss of 
heterozygosity in a subpopulation of cells in the sample. 

C. Analysis of Microsatellite Instability 

Microsatellites are di- or trinucleotide repeats found throughout the genome. A 
particular array of microsatellite repeats is often associated with a particular genomic 
1 5 sequence and is stably inherited under normal conditions. Expansions of microsatellite 
copy number typically, called "microsatellite instability," are associated with defects in 
mismatch repair. Accordingly, changes in a microsatellite region indicate that the 
patient is at risk for a mutation in other genomic regions. 

In order to detect microsatellite instability as an indicator of a mutation in a 
20 cancer-associated gene, one must first identify a microsatellite region associated with 
the gene of interest. Such regions are typically identified on a database, such as 
GenBank, EMBL, and others. Once a wild-type microsatellite region associated with, 
for example, the p53 tumor suppressor gene, is identified, an oligonucleotide probe is 
constructed that spans the microsatellite region and the regions immediately 5' and 
25 immediately 3' to the microsatellite region. The precise length of probes may be 
determined by the experimenter. , Probes are constructed that hybridize to the 
microsatellite region, including portions extending 5* and 3\ on both the maternal and 
paternal alleles with which the microsatellite is associated (e.g., p53). 

An appropriate sample of body tissue or fluid is obtained and processed as 
30 described herein. Double stranded DNA is denatured and an excess of maternal and 
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paternal probes, as described above, are introduced into the sample under 
hybridization conditions. The probes are detectably labeled as described above. 
Complement of the strands to be detected may optionally be removed by methods 
described above. The sample is then washed to remove unhybridized probe and the 
5 amount of hybridized probe in quantitatively detected. 

Quantitative detection may be accomplished by any means described herein. 
For example, probes may be attached to hybridization beads such that probes that bind 
to maternal allele are attached to beads of one size and probes that bind to paternal 
allele are attached to beads of a second size that is distinguishable from beads of the 
10 first size. Beads with attached probe may be counted as described above. 

The detection of a statistically-significant difference between the amount of 
probe binding to the maternal allele and the amount of probe binding to the paternal 
allele is indicative of microsatellite instability. As previously mentioned, microsatellite 
instability can be indicative of a mutation at the locus in which the microsatellite 
1 5 resides. If the microsatellite region is associated with a tumor suppressor gene or an 
oncogene, the detection of microsatellite instability in an allele in a subpopulation of 
cells in a biological sample is indicative of the potential for cancer or that cancer or 
precancer may have already developed. Further testing as described herein (either by 
invasive or noninvasive means) may then be conducted. 

20 In an alternative embodiment, a "fingerprint" of microsatellites is taken from 

regions associated with cancer-causing genes in a sample obtained from a patient. 
Such a fingerprint may be obtained by standard methods. The fingerprint comprises 
the sequence of wild-type microsatellites associated with the cancer-causing gene or 
genes. Once obtained, the fingerprint is stored and is used in future tests of samples 

25 from the same patient in order to monitor changes in microsatellite regions (i.e. 
microsatellite instability) that may be associated with the development of cancer. 
Changes in microsatellite length and/or sequence over time may be used to prescribe 
additional testing and/or treatment in order to detect and remove cancerous tissue at 
an early stage in its etiology. 

30 Additional embodiments of the invention are apparent upon consideration of the 

following claims. 
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CLAIMS 

What is claimed is: 

11. A method of screening for the presence of a disease in a patient, the method 

2 comprising the steps of: 

3 a) obtaining a sample comprising a circumferential surface of a stool 

4 voided by a patient; and 

5 b) performing an assay to detect in the sample characteristics 

6 indicative of the presence of said disease. 

1 2. A method for screening for the presence of a colorectal cancer or precancerous 

2 lesion in a patient, the method comprising the steps of: 

3 a) obtaining a stool voided by a patient; 

4 b) removing a circumferential surface from said stool; and 

5 c) performing an assay on said surface to detect characteristics 

6 indicative of the presence of cells or cellular debris shed onto said 

7 stool. 

1 3. The method of claim 2, wherein said assay detects debris from a clonal 

2 population of transformed cells comprising said lesion. 

1 4. The method of claim 3, wherein said assay detects a protein expressed by the 

2 transformed cells indicative of the presence of said cells. 

1 5. The method of claim 3 t wherein said assay detects a DNA characteristic 

2 indicative of the presence of said cells. 

1 6. The method of claim 1 , further comprising the step of homogenizing said portion 

2 in a physiologically compatible buffer prior to step (b). 

1 7. The method of claim 6, wherein said physiologically compatible buffer comprises 

2 a detergent and a proteinase, 

1 8. The method of claim 1 , wherein said assay detects the presence of 

2 carcinoembryonic antigen shed from said lesion. 
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1 9. The method of claim 1 , wherein said assay comprises the step of exposing said 

2 sample to an antibody which specifically binds a molecule characteristic of the 

3 presence of said debris. 

1 1 0. The method of claim 5. wherein said DNA characteristic is a mutation. 

The method of claim 10, wherein said mutation is selected from the group 
consisting of loss of heterozygosity and microsatellite instability. 

1 1 2. The method of claim 1 0, wherein said characteristics comprise a deletion in a 

2 tumor suppressor allele. 

1 1 3. The method of claim 1 , wherein said assay comprises the step of determining 

2 whether a difference exists in said sample between a number X of a first allele 

3 known or suspected to be mutated in a subpopulation of cells in the sample and 

4 a number Y of a second allele that is known or suspected not to be mutated in a 

5 subpopulation of cells in the sample, the presence of a statistically-significant 

6 difference being indicative of a mutation in a subpopulation of cells in the 

7 sample and the potential presence of a cancerous or precancerous lesion. 
The method of claim 1, wherein said assay comprises the step of determining 
whether a difference exists between a number of a target tumor suppressor 
allele in the sample and a number of a non-cancer-associated reference allele in 
the sample, the presence of a statistically-significant difference being indicative 
of a deletion of the target tumor suppressor allele in a subpopulation of cells in 
the sample and the potential presence of a cancerous or precancerous lesion. 

1 15. The method of claim 5, wherein said characteristics comprise a loss of 

2 heterozygosity encompassing a polymorphic locus. 

1 16. The method of claim 1 3, wherein said assay comprises the steps of: 

2 a) detecting an amount of a maternal allele at a polymorphic locus in 

3 the biological sample; 

4 b) detecting an amount of a paternal allele at the polymorphic locus in 

5 the biological sample; and 
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6 c) determining whether a difference exists between the amounts of 

7 maternal and paternal allele, 

8 the presence of a statistically-significant difference being indicative of a deletion 

9 at the polymorphic locus in a subpopulation of cells in the biological sample and 
1 0 the potential presence of a lesion. 

1 1 7. The method of claim 1 , comprising the additional step of performing a visual 

2 examination of the colon of a patient presenting positive assay results. 

1 18. The method of claim 1 , wherein said at least cross-sectional portion comprises 

2 an entire stool voided by a patient. 

1 19. A method for reducing morbidity from colorectal cancer in a population, the 

2 method comprising the steps of: 

3 a) obtaining a sample comprising at least a representative portion of 

4 a stool voided by a patient; 

5 b) performing an assay to detect in the sample the presence of 

6 cellular debris shed from the lesion into the voided stool; and 

7 c) performing a visual examination of the colon of a patient presenting 

8 positive results in said assay to detect the presence of a lesion. 

1 20. The method of claim 1 9, wherein said assay detects a loss of DNA in a portion of 

2 debris shed from a clonal population of transformed epithelial cells comprising 

3 said lesion. 



1/4 




FIG. 1 



SUBSTITUTE SHEET (RULE26) 



2/4 




FIG. 2 



SUBSTITUTE SHEET (RULE26) 



3/4 




© ® V ® 



3 
4 



1 r i i i I I 



0 @©0©@® 

« " » «t 



® 0 0 0 



*5 




FIG. 3 



SUBSTITUTE SHEET (RULE26) 



3'ACGCTACGG5 
5'....ATCGGCTTACTGCGATGCC. 



3 ' ....TAGCCGAATGACGCTACGG. 
5'ATCGGCTTA3' 



3'acgctacgg5' 
5\..atcggcttattgcgatgcc... 



3\..TAGCCGAATAACGCTACGG.. 
5 ' ATCGGCTTA3 ' 



FIG. 4 



SUBSTITUTE SHEET (RULE26) 



WO 99/53316 . 

' - 1 

SEQUENCE LISTING 

<110> Lapidus, Stanley N 
Shuber, Anthony P 
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<120> Methods for Disease Diagnosis From Stool Samples 

<130> EXT-015PC 

<140> 
<141> 

<150> US 09/059,713 
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<160> 8 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
Ml 

<400> 1 
ggcatcgca 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
M2 

<400> 2 

atcggcttac tgcgatgcc 

<210> 3 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
M3 

<400> 3 

ggcatcgcag taagccgat 

<210> 4 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
M4 

<400> 4 
atcggctta 

<210> 5 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
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<400> 5 

ggcatcgca g 

<210> 6 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
F2 

<400> 6 

atcggcttat tgcgatgcc ^9 

<210> 7 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
<400> 7 

ggcatcgcaa taagccgat 19 

<210> 8 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
F4 

<400> 8 

atcggctta 9 
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